Non-viral vesicle vector for cardiac specific gene delivery

ABSTRACT

The invention is a non-viral vesicle vector for the delivery of nucleic acid to various cardiac cell types. The vesicle vector contains the hepatitis B envelope protein wherein at least part of the liver targeting sequence is deleted and replaced with a specific cardiac cell targeting sequence. The targeting sequence may be derived from viruses that have the natural tropism desired (e.g. adenovirus type 5 knob protein for cardiomyocyte delivery) or mammalian sequences (e.g. endothelin-1 for vascular endothelial cell delivery). The vesicle vector contains an expression construct for the expression of therapeutic genes in cardiac tissues.

CROSS-REFERENCES TO RELATED APPLICATIONS

[0001] This application claims the benefit of priority of U.S. provisional application Serial No. 60/287,423 filed Apr. 30, 2001 which is incorporated herein by reference in its entirety.

SEQUENCE LISTING

[0002] A sequence listing is submitted herewith under 35 C.F.R. §1.821 and is incorporated herein by reference.

BACKGROUND OF THE INVENTION

[0003] The utility of gene delivery vectors for gene therapy is limited by the nonselective nature in which the vectors, either non-viral or viral, interact with the cell surface, resulting in transduction of numerous cell types in addition to the target cells. Much effort has been devoted to understand the mechanisms of viral targeting to different cell types. The information derived from these studies is now being exploited to select viruses that have the desired natural tropism or to modify viral targeting signals to redirect viruses to the cell type of choice.

[0004] Methods to redirect the targeting of viral vectors include the use of bi-specific antibodies (Wickham et al, 1996). Adenoviral vectors with a fiber protein modified to contain the antigenic FLAG peptide sequence were prepared. A bispecific antibody comprised of a monoclonal antibody to the FLAG epitope and a monoclonal antibody to a cell type specific surface antigen (e.g. α_(v) integrin for targeting endothelial and smooth muscle cells) was bound to the adenovirus to target it to α_(v) integrin expressing cells. Transduction of specific cell types was greatly increased by the use of this method, however the method is sufficiently cumbersome to make it impractical for use in gene therapy. Such a method also requires knowledge of a specific cell surface receptor to which a portion of the bispecific antibody can be directed.

[0005] Subsequently, viral particles were modified directly to contain targeting sequences such as integrin binding peptide sequences (Dmitriev et al., 1998). The H1 loop of the adenovirus fiber knob was modified to contain the RGD integrin binding sequence. The modified vector was then able to transduce primary tumor cells with greater efficiency. Phage display screening methods have been used to select peptides for incorporation into adenovirus fiber protein to allow for targeting to cell types with unknown cell surface receptors (Nicklin et al., 2000). Insertion of sequences to promote specific targeting can be coupled with mutation of natural targeting sequences to increase the tissue specificity of the virus.

[0006] Although the methods above have helped to resolves some tissue specificity problems, gene therapy protocols are also limited by the absence of a high efficiency gene delivery vector that can delivery large constructs, be produced on a sufficient scale for use and is safe for administration to humans. Adenovirus, adeno-associated virus (AAV) and the majority of retroviruses have a small capacity for heterologous nucleic acid sequence (i.e. 4.5 Kb for AAV and approximately 8 Kb for conventional adenovirus) which is insufficient for a number of applications. Preparation of high titer viral stocks is difficult and laborious. Viral vectors also contain a portion of the viral genome which is not ideal for safety considerations, especially with the use of retroviruses.

[0007] The development of good gene delivery vectors would be a boon to a number of medical fields, including cardiology. Cardiovascular gene therapy could be used for the treatment of ischemic diseases by delivery of angiogenic factors, restenosis after angioplasty and stent implantation, and to treat atherogenesis and thrombogenesis. Heart failure could be prevented or treated by the transfer of genes that would correct problems with calcium handling. The major limitation of cardiac gene therapy is the lack of a good gene delivery vector with high gene transfer efficiency and minimal side effects.

[0008] Striated muscle tissue is considered amenable for gene transfer because of natural tissue selectivity if adenovirus and AAV, because of high expression of Coxsackie-adenovirus receptor (CAR), the adenovirus receptor, in this tissue. Type 2 AAV associates with membrane associated heparin sulfate proteoglycan (HSPG). In skeletal muscle, type 2 AAV predominantly transduces slow myofibers, which are very similar to cardiac muscle cells. Several experimental attempts of gene transfer therapy to the myocardium have been undertaken. Recently highly efficient gene delivery to cardiac tissue using adenovirus and AAV was recently achieved. Both viral vectors can be used to transfer genes to non-dividing cells, including rod shaped adult cardiomyocytes. However, adenovirus and AAV vectors have been found to be inefficient in transduction of vascular tissues. Attempts to deliver genes to vascular cells with retroviral and non-viral vectors have been similarly disappointing.

[0009] Studies are being undertaken to increase the specificity of non-viral vectors. A novel, liver-specific vesicle vector expressing modified surface proteins of the hepatitis B virus was recently described by Yamada et al (2001 a). The vesicles containing the hepatitis B membrane proteins are generated by the methods well known to those skilled in the art (Kuroda et al, 1992, and Yamada et al., 2001b, incorporated herein by reference). Briefly, a modified hepatitis B envelope (env) L protein, containing the pre-S1+pre-S2+S peptides, can be effectively generated in yeast by fusing the coding sequence for the chicken lysozyme signal sequence in frame to the beginning of the coding sequence for the modified env L protein (SEQ ID 1). The signal sequence directs the insertion of the proteins into the endoplasmic reticulum during translation. Protein rich vesicles bud from the endoplasmic reticulum and accumulate in the cytoplasm of the yeast cell. The vesicles are composed of lipid bilayers derived from the ER with the modified env L protein as the major protein component. Particles formed by this method are very stable and can be easily purified through repetitive cesium chloride and sucrose gradients by methods well known to those skilled in the art.

[0010] Plasmid DNA into the env L containing particles by electroporation (Yamada et al. 2001 a). Such DNA containing particles were demonstrated to facilitate entry of the DNA specifically into liver cells both in culture and upon systemic administration to nude mice. Yamada et al. (2001 a) suggested that such a vesicle vector could be used for tissue specific delivery of nucleic acid and other compounds to the any tissue by altering the tissue targeting sequence exposed on the surface of the vesicle.

SUMMARY OF THE INVENTION

[0011] The invention is the modification of the hepatitis B env L vesicle vector for delivery of genes to cardiac cells. Targeting sequences from a number of sources including targeting sequences from vectors with a natural tropism for various types of cardiac cells and sequences of endogenous ligands that bind to surface proteins present on cardiac cells. For example, two peptide sequences from coat protein of viral vectors that are widely used in muscle gene delivery can be used for delivery to cardiomyocytes. The C-terminal knob-domain of Ad5 fiber protein is known to bind CAR, which is highly expressed on the surface of cardiomyocytes. The loop IV region of VP3 capsid protein of AAV binds to heparin sulfate proteoglycans on the surface of cardiomyocytes. For targeting cells throughout the circulatory system including cardiomyocytes, smooth muscle cells and endothelial cells, α_(v) integrin binding RGD peptides or poly-lysine motifs (pK7) are incorporated. To selectively target vascular endothelial cells, peptides including vascular endothelial growth factor (VEGF), platelet-endothelial cell adhesion molecule-1 (PCAM-1), angiopoietin-1 and -2, any of the family of ephrins (e.g. ephrin-A1), L-seelctin, CD34, Lfa-1 and Mac-1 are used. Various vasotropic peptides including angiotensin 1 and 11 and endothelin-1 are used to broadly target the cardiovascular system. The targeting sequence is selected to infect the species of interest.

[0012] The unique feature of the invention is the use of the tissue selective non-viral vector to deliver gene expression cassettes, proteins or other agents to cardiovascular tissues. Targeting the non-viral gene delivery vesicles via peptides known to specifically bind the surface of specific cardiac and vascular cell types provides a method to direct highly efficient gene delivery. The safety of such non-viral vectors in humans is already established as the vesicle vector has long been used for the hepatitis B virus vaccine. This is in contrast to viral gene transfer vectors which include much of the genome of the virus from which they were derived. Production of the vesicles is relatively easy as compared to production of high titer virus stocks. The vesicles are highly stable and can be produced in large quantities making them ideal for gene therapy.

[0013] The invention is a method for the treatment of cardiovascular disease comprising the administration of the non-viral vector of the invention preferably containing a gene expression construct, possibly in conjunction with other agents. The expression construct may be single or double stranded DNA containing any of a number of promoters including, but not limited to general (e.g. cytomegalovirus, Rous sarcoma virus) and tissue specific (e.g.myosin light chain 2v, cardiac ankirin repeat, ANF and BNP) promoters. The construct may contain additional regulatory elements including, but not limited to enhancers, introns, poly A sequences, RNA targeting sequences. Sequences to promote replication of the plasmid including SV40 origin of replication can be included. Inverted terminal repeat (ITR) sequences from AAV can be included in the construct to promote expression cassette stability or to enhance integration into the host DNA with the AAV Rep protein. In lieu of ITR sequences, eukaryotic DNA transposon/transposases systems can be used to promote integration. The non-viral vesicle vector detailed is for delivery of therapeutic constructs to humans. However, it is well within the skill of those in the art to modify the vesicle vector of the instant invention for the use in any of a number of animals, especially mammals.

[0014] Gene therapy may be used alone to deliver growth factors (e.g. delivery of VEGF to promote angiogenesis) or to enhance expression of gene products in the cell (e.g. delivery of sarcoplasmic reticulum Ca²⁺-ATPase (SERCA) or mutated forms of phospholamban to restore Ca²⁺ normal contractility). Alternatively, gene therapy can be used in conjunction with other therapies (e.g. delivery of antisense cdc2 and antisense proliferating cell nuclear antigen (PCNA) to prevent restenosis after balloon angioplasty). As the vesicle vector contains cell specific targeting signals, it can be delivered intravenously or intra-arterially rather than by more invasive methods (e.g. direct cardiac injection).

DETAILED DESCRIPTION AND PREFERRED EMBODIMENTS

[0015] The invention is a vesicle vector for the treatment of cardiac disease comprising a natural lipid vesicle preferably produced in yeast or Sf9 insect cells containing hepatitis B env L protein modified to contain a cardiac targeting sequence in the S1 region such that the targeting sequence is exposed on the surface of the vesicle and an nucleic acid construct inside the vesicle for the expression of a therapeutic nucleotide sequence or gene for cardiac cells. The vesicles are prepared by the vaccine production method of Kuroda (1992) further refined by Yamada (2001 b). Briefly, the hepatitis B env L protein is composed of three regions: the 108- or 119-residue pre-S1 region involved in the direct interaction with hepatocytes, the 55-residue pre-S2 region associated with the polymerized albumin-mediated interaction and the major 226-residue S-protein region. Attempts to produce L protein in various eukaryotic cells had been unsuccessful, probably due to the presence of the N-terminus of the pre-S1 peptide. The coding sequence of the N-terminus of the L protein was replaced by a chicken lysosome signal sequence to direct the translocation of the N-terminus through the endoplasmic reticulum (ER). The chimeric sequence was inserted into a yeast (S. cerevisiae) expression vector and inserted into yeast using a standard transformation protocol. The chimeric L-protein was produced in abundance, up to 42% of the total yeast protein, and was determined to be properly inserted into the membrane. Vesicles budded off of the ER to form 23 nm spherical and filamentous particles containing the protein in the membrane of the vesicles. The yeast cells were disrupted with glass beads to release the vesicles. Vesicles were purified by serial rounds of discontinuous cesium and sucrose gradients. Production and purification of vesicles from insect cells would be performed in a similar method. A crude membrane fraction could be prepared as with the yeast cells, by homogenization and differential centrifugation. The fraction can be loaded onto cesium or sucrose gradients as with the yeast extract for purification of vesicles. The methods are amenable to inexpensive, large scale production of vesicles which is necessary for gene transfer. Vesicles are stable for long term storage at a low temperature but are unstable upon repeated freeze-thaw cycles.

[0016] A number of cardiac targeting sequences are available for use in the vesicle vector of the invention. The selection of a specific sequence is dependent upon the cardiac tissue to be targeted. For example, viral targeting sequences from type 5 adenovirus and AAV can be used for targeting cardiomyocytes. Vascular endothelial cells are more efficiently targeted by the use of natural peptide ligands such as vascular endothelial growth factor (VEGF), platelet-endothelial cell adhesion molecule (PCAM-1), angiopoietin-1 and -2, ephrins (such as ephrin-A1 precursor), L-selectin, CD34, LFA-1 and Mac-1. RGD peptides, preferably cyclic, have a broad specificity for a number of vascular tissues (Koivunen et al, 1995, incorporated herein by reference) as do poly-lysine motifs. Target cardiac tissues and ligands for integration into the vesicle vector of the invention are listed below. Sequences listed are human targeting sequences for delivery of agents to human tissues. It is understood that species appropriate targeting sequences can be selected to target the non-viral vesicle vector in various organisms (e.g. mouse sequences could be incorporated for delivery of agents to mouse tissues). Additionally, the full length coding sequence of the targeting protein need not be incorporated into the hepatitis B surface protein. Methods to determine the essential factors for targeting to the tissue of interest are well known (e.g. phage panning, affinity chromatography, far western blots). Such selections are routine in the art and should not be considered a limitation of the invention. Cell type Ligand broad specificity, angiotensin (SEQ ID 2) cardiovascular endothelin-1 (SEQ ID 3) endothelin-2 (SEQ ID 4) cardiomyocyte adenovirus type 5 fiber protein (SEQ ID 5, nt 385-581 or portion thereof) AAV 2 coat protein sequences, VP1 loop IV domain (SEQ ID 6, nt 416-646 or portion thereof) coxsackie virus B3, VP1 (SEQ ID 7) coxsackie virus receptor (SEQ ID 8) vascular cells RGD peptides, preferably cyclic (SEQ ID 9) polylysine motifs vascular endothelial VEGF (SEQ ID 10) cells PCAM-1 (SEQ ID 11) angiopoietin-1 (SEQ ID 12) angiopoietin-2 (SEQ ID 13) ephrins, e.g. ephrin-A1 (SEQ ID 14) L-selectin (SEQ ID 15) CD34 (SEQ ID 16) LFA-1 (SEQ ID 17) Mac-1 (SEQ ID 18)

[0017] The vesicle vectors can be used for the delivery of any nucleic acid construct, single- or double-stranded DNA or RNA, to the cardiac tissue. The nucleic acid sequence to be delivered would depend on the disease state and the tissue to which the gene is delivered. Potential therapies include long term expression of gene products to replace or enhance expression of proteins for the treatment of heart failure such as mutant forms of phospholamban (see WO 00/25804, incorporated herein by reference) SERCA-2, G-protein coupled receptors, G-protein coupled receptor modifier or -adrenergic receptor (β-AR) to increase cardiac contractility. Arrhythmia can be treated by expression of potassium channels and their associated molecules. Reperfusion injury can by treated by expression of superoxide dismutase (SOD) or nitric oxide synthase (NOS). Atherosclerosis can be treated by expression of negative cell cycle regulators or a lipoprotein receptor such as low density lipoprotein (LDL) receptor. Alternatively, antisense oligonucleotides can be produced short term to inhibit expression of cdc2 and PCNA of can be produced to inhibit restenosis after balloon angioplasty.

[0018] The construct may optionally contain additional regulatory and enhancer elements to modulate gene expression, intron and poly-A sequences to promote RNA processing and gene expression, RNA targeting sequences, AAV-ITR or eukaryotic transposon sequences to promote stabilization of expression cassettes and integration into the host genome and viral origin of replication sequences to promote amplification of the plasmid in host cells. Such sequences are well known to those skilled in the art. The number of elements that can be inserted into the nucleic acid construct as the size is not limited by the requirements of a viral genome as is the case with many gene transfer protocols.

[0019] Any of a number of promoter sequences are known to be functional in cardiac cells. These include both non-tissue specific promoters such as CMV, RSV, ubiquitin, chicken β-actin and elongation factor (EF)-1α; and tissue specific promoters such as myosin light chain 2v, CARP, ANF and BNP for cardiomyocytes; SM22 for smooth muscle cells; and Fit-1 (VEGFR-1), Flk-1 (VEGFR-2), endothelial type nitric oxide synthase (eNOS) and endothelin.

[0020] AAV-ITR sequences may be incorporated into the construct flanking all of the coding and regulatory sequences, other than any origins of replication. The AAV-ITR sequences have been demonstrated to increase the stability of transferred constructs in gene therapy protocols. Alternatively, the AAV-ITR sequences may enhance integration into the human genome at a specific site with the cooperation of the AAV-Rep protein, which may be supplied by incorporation into the vesicles with the nucleic acid construct or by expression cassettes packaged into the same vesicle.

[0021] Eukaryotic transposon sequences can be incorporated into the construct flanking all of the coding sequences and regulatory elements, similar to the AAV-ITR sequences. Transposase to promote integration may be expressed from the same expression cassette or from a separate expression cassette packaged into the same vesicle.

[0022] The coding sequence incorporated into the expression cassette or the agent to be delivered will be dependent on the disease to be treated, as discussed above. The specific contents of the non-viral vesicle vector is not a limitation of the instant invention.

[0023] In a preferred embodiment, the nucleic acid construct of the invention is introduced into the vesicles by electroporation. The nucleic acid construct is mixed thoroughly with the vesicles, brought to a final volume in water and transferred to an electroporation cuvette. Voltage and resistance vary widely depending on the size (gap length) of the cuvette and the volume of material in the cuvette. Such parameters can be readily modified by methods well known to those skilled in the art to result in maximum transfer of nucleic acid into vesicles with minimum destruction of vesicles.

[0024] Alternatively the nucleic acid may be introduced into the vesicle by fusion with nucleic acid containing liposomes by methods well known to those skilled in the art (Dzau et al, 1996). The construct of the invention is encapsulated into liposomes prepared by vortexing. Liposomes may be composed of known phospholipids and membrane components (e.g. phosphatidyl-choline, cholesterol) or of commercially available proprietary mixtures of membrane components (e.g. Lipofectamine from Gibco-BRL). Nucleic acid encapsulated in liposomes will fuse with the yeast or insect cell derived vesicles upon incubation at 37° C. for 10-30 minutes.

[0025] The nucleic acid or protein containing non-viral vesicle vectors of the invention are administered to the individual intravenously or intraarterially. To increase delivery, the vesicle vector can be administered directly adjacent to the heart. Such an application would be most common in conjunction with surgery. The individual is monitored on regular intervals for the expression of the gene products or for phenotypic recovery. The amount of the non-viral vesicle to be administered would depend on the strength of the promoter, integration sequences, number of plasmids per vesicle and a number of other considerations well know to those skilled in the art. As methods for monitoring the state of health of individuals are well known, an effective dose can be readily determined.

EXAMPLE 1

[0026] Delivery of SERCA-2 to caridomyocytes using a viral targeting sequence. The coding sequence of the S1 portion of the hepatitis B env L protein containing the chicken lysosome signal sequence was modified to contain all or part of the knob domain of adenovirus type 5 (amino acids 385-581) (SEQ ID 2) such that the cardiomyocyte binding domain is exposed on the exterior surface of the vesicle vector. Methods for modifying nucleic acid sequences are well known to those skilled in the art. The coding sequence for the modified env L protein is inserted into an appropriate expression vector and transformed into yeast. Vesicles are purified from the yeast by the method of Kuroda (1992) and Yamada (2001 b). A plasmid vector for the expression of pseudophosphorylated mutants of phospholamban or SERCA-2 in cardiomyocytes is generated containing the coding sequence for SERCA-2 driven by a CMV promoter. The coding sequence and promoter are flanked by AAV-ITR sequences. The plasmid is introduced into the vesicles by electroporation. The vesicles are administered intravenously to the patient. Cardiac function is monitored by methods well known to those skilled in the art (e.g. blood pressure, EKG) to determine the efficacy of the gene therapy protocol.

EXAMPLE 2

[0027] Delivery of VEGF to vascular endothelial cells using an integrin targeting sequence. The coding sequence of the S1 portion of the hepatitis B env L protein containing the chicken lysosome signal sequence is modified to contain any of a number of α_(v) integrin RGD binding motifs (Koivunen et al, 1995, incorporated herein by reference) such that the RGD sequence is exposed on the exterior surface of the vesicle vector. The RGD sequence preferably contains cysteine residues to allow for the formation of disulfide bonds to form a cyclic peptide (e.g. ACDCRGDCFCG) (SEQ ID 9). Methods for modifying nucleic acid sequences are well known to those skilled in the art. The coding sequence for the modified env L protein is inserted into an appropriate expression vector and transformed into yeast. Vesicles are purified from the yeast by the method of Kuroda (1992) and Yamada (2001 b). A plasmid vector for the expression of VEGF in vascular endothelial cells is generated containing the coding sequence for VEGF driven by an RSV promoter. The coding sequence and promoter are flanked by AAV-ITR sequences. The plasmid is introduced into the vesicles by electroporation. The vesicles are administered intravenously to the patient. VEGF stimulated angiogenesis is monitored using imaging methods known to those skilled in the art.

[0028] Although an exemplary embodiment of the invention has been described above by way of example only, it will be understood by those skilled in the field that modifications may be made to the disclosed embodiment without departing from the scope of the invention, which is defined by the appended claims.

REFERENCES

[0029] Dmitriev, I et al. (1998) An adenovirus vector with genetically modified fibers demonstrates expanded tropism via utilization of a coxsackievirus and adenovirus receptor-independent cell entry mechanism. J. Virol. 72:9706-9713.

[0030] Koivunen, E., et al (1995) Phage libraries displaying cyclic peptides with different ring sizes: ligand specificities of the RGD-directed integrins. Biotechnology 13:265-70.

[0031] Kuroda, S. et al (1992) Hepatitis B virus envelope L protein particles. J. Biol. Chem. 267:1953-1961.

[0032] Nicklin, S. A. (2000) Selective targeting of gene transfer to vascular endothelial cells by use of peptides isolated by phage display. Circulation 102:231-237.

[0033] Wickham, T. J. et al. (1996) Targeted adenovirus gene transfer to endothelial and smooth muscle cells using bispecific antibodies. J. Virol. 70:6831-6838.

[0034] Yamada, T. et al (2001 a) A new pinpoint gene delivery system using genetically engineered hepatitis B virus envelope L particles. Molecular Biology and New Therapeutic Strategies: Cancer Research in the 21^(st) Century. 5^(th) Joint Conference of the American Association for Cancer Research and the Japanese Cancer Association. Hawaii, USA, Feb. 12-16, 2001.

[0035] Yamada. T. et al (2001 b) Physiochemical and immunological characterization of hepatitis B virus envelope particles exclusively consisting of the entire L (pre-S1+pre-S2+S) protein. Vaccine 19:3154-3163.

[0036] Sequence Accession Numbers K02215; XM_(—)043856; NM_(—)001956; M18369; AF043303; M33854; NM_(—)001338; NP_(—)003377; M28526; AAB50557; XP_(—)034835; NM004428; X117519; M81104; Y00796; J03925 all incorporated herein by reference.

1 18 1 1286 DNA Hepatitis B virus 1 gtcgagtata aaaacaatga gatctttgtt gatcttggtt ttgtgtttct tgccattggc 60 tgctttgggt aaggttcgac aaggcatggg aggttggtct tccaaacctc gacaaggcat 120 ggggacgaat ctttctgttc ccaatcctct gggattcttt cccgatcacc agttggaccc 180 tgcgttcgga gccaactcaa acaatccaga ttgggacttc aaccccaaca aggatcaatg 240 gccagaggca aatcaggtag gagcgggagc attcgggcca gggttcaccc caccacacgg 300 cggtcttttg gggtggagcc ctcaggctca gggcatattg acaacagtgc cagcagcacc 360 tcctcctgcc tccaccaatc ggcagtcagg aagacagcct actcccatct ctccacctct 420 aagagacagt catcctcagg ccatgcagtg gaattccaca acattccacc aagctctgct 480 agatcccaga gtgaggggcc tatattttcc tgctggtggc tccagttccg gaacagtaaa 540 ccctgttccg actactgcct cacccatatc tggggaccct gcaccgaaca tggagaacac 600 aacatcagga ttcctaggac ccctgctcgt gttacaggcg gggtttttct tgttgacaag 660 aatcctcaca ataccacaga gtctagactc gtggtggact tctctcaatt ttctaggggg 720 agcacccacg tgtcctggcc aaaattcgca gtccccaacc tccaatcact caccaacctc 780 ttgtcctcca atttgtcctg gctatcgctg gatgtgtctg cggcgtttta tcatattcct 840 cttcatcctg ctgctatgcc tcatcttctt gttggttctt ctggactacc aaggtatgtt 900 gcccgtttgt cctctacttc caggaacatc aaccaccagc acggggccat gcaagacctg 960 cacgattcct gctcaaggaa cctctatgtt tccctcttgt tgctgtacaa aaccttcgga 1020 cggaaactgc acttgtattc ccatcccatc atcctgggct ttcgcaagat tcctatggga 1080 gtgggcctca gtccgtttct cctggctcag tttactagtg ccatttgttc agtggttcgt 1140 agggctttcc cccactgttt ggctttcagt tatatggatg atgtggtatt gggggccaag 1200 tctgtacaac atcttgagtc cctttttacc tctattacca attttctttt gtctttgggt 1260 atacatttaa attgaattga attgaa 1286 2 2099 DNA Homo sapiens 2 aagaagctgc cgttgttctg ggtactacag cagaagggta tgcggaagcg agcaccccag 60 tctgagatgg ctcctgccgg tgtgagcctg agggccacca tcctctgcct cctggcctgg 120 gctggcctgg ctgcaggtga ccgggtgtac atacacccct tccacctcgt catccacaat 180 gagagtacct gtgagcagct ggcaaaggcc aatgccggga agcccaaaga ccccaccttc 240 atacctgctc caattcaggc caagacatcc cctgtggatg aaaaggccct acaggaccag 300 ctggtgctag tcgctgcaaa acttgacacc gaagacaagt tgagggccgc aatggtcggg 360 atgctggcca acttcttggg cttccgtata tatggcatgc acagtgagct atggggcgtg 420 gtccatgggg ccaccgtcct ctccccaacg gctgtctttg gcaccctggc ctctctctat 480 ctgggagcct tggaccacac agctgacagg ctacaggcaa tcctgggtgt tccttggaag 540 gacaagaact gcacctcccg gctggatgcg cacaaggtcc tgtctgccct gcaggctgta 600 cagggcctgc tagtggccca gggcagggct gatagccagg cccagctgct gctgtccacg 660 gtggtgggcg tgttcacagc cccaggcctg cacctgaagc agccgtttgt gcagggcctg 720 gctctctata cccctgtggt cctcccacgc tctctggact tcacagaact ggatgttgct 780 gctgagaaga ttgacaggtt catgcaggct gtgacaggat ggaagactgg ctgctccctg 840 atgggagcca gtgtggacag caccctggct ttcaacacct acgtccactt ccaagggaag 900 atgaagggct tctccctgct ggccgagccc caggagttct gggtggacaa cagcacctca 960 gtgtctgttc ccatgctctc tggcatgggc accttccagc actggagtga catccaggac 1020 aacttctcgg tgactcaagt gcccttcact gagagcgcct gcctgctgct gatccagcct 1080 cactatgcct ctgacctgga caaggtggag ggtctcactt tccagcaaaa ctccctcaac 1140 tggatgaaga aactgtctcc ccggaccatc cacctgacca tgccccaact ggtgctgcaa 1200 ggatcttatg acctgcagga cctgctcgcc caggctgagc tgcccgccat tctgcacacc 1260 gagctgaacc tgcaaaaatt gagcaatgac cgcatcaggg tgggggaggt gctgaacagc 1320 attttttttg agcttgaagc ggatgagaga gagcccacag agtctaccca acagcttaac 1380 aagcctgagg tcttggaggt gaccctgaac cgcccattcc tgtttgctgt gtatgatcaa 1440 agcgccactg ccctgcactt cctgggccgc gtggccaacc cgctgagcac agcatgaggc 1500 cagggcccca gaacacagtg cctggcaagg cctctgcccc tggcctttga ggcaaaggcc 1560 agcagcagat aacaaccccg gacaaatcag cgatgtgtca cccccagtct cccacctttt 1620 cttctaatga gtcgactttg agctggaaag cagccgtttc tccttggtct aagtgtgctg 1680 catggagtga gcagtagaag cctgcagcgg cacaaatgca cctcccagtt tgctgggttt 1740 attttagaga atgggggtgg ggaggcaaga accagtgttt agcgcgggac tactgttcca 1800 aaaagaattc caaccgacca gcttgtttgt gaaacaaaaa agtgttccct tttcaagttg 1860 agaacaaaaa ttgggtttta aaattaaagt atacattttt gcattgcctt cggtttgtat 1920 ttagtgtctt gaatgtaaga acatgacctc cgtgtagtgt ctgtaatacc ttagtttttt 1980 ccacagatgc ttgtgatttt tgaacaatac gtgaaagatg caagcacctg aatttctgtt 2040 tgaatgcgga acaatagctg gttatttctc ccttgtgtta gtaataaacg tcttgccac 2099 3 1250 DNA Homo sapiens 3 ggagctgttt acccccactc taataggggt tcaatataaa aagccggcag agagctgtcc 60 aagtcagacg cgcctctgca tctgcgccag gcgaacgggt cctgcgcctc ctgcagtccc 120 agctctccac cgccgcgtgc gcctgcagac gctccgctcg ctgccttctc tcctggcagg 180 cgctgccttt tctccccgtt aaaagggcac ttgggctgaa ggatcgcttt gagatctgag 240 gaacccgcag cgctttgagg gacctgaagc tgtttttctt cgttttcctt tgggttcagt 300 ttgaacggga ggtttttgat cccttttttt cagaatggat tatttgctca tgattttctc 360 tctgctgttt gtggcttgcc aaggagctcc agaaacagca gtcttaggcg ctgagctcag 420 cgcggtgggt gagaacggcg gggagaaacc cactcccagt ccaccctggc ggctccgccg 480 gtccaagcgc tgctcctgct cgtccctgat ggataaagag tgtgtctact tctgccacct 540 ggacatcatt tgggtcaaca ctcccgagca cgttgttccg tatggacttg gaagccctag 600 gtccaagaga gccttggaga atttacttcc cacaaaggca acagaccgtg aaaatagatg 660 ccaatgtgct agccaaaaag acaagaagtg ctggaatttt tgccaagcag gaaaagaact 720 cagggctgaa gacattatgg agaaagactg gaataatcat aagaaaggaa aagactgttc 780 caagcttggg aaaaagtgta tttatcagca gttagtgaga ggaagaaaaa tcagaagaag 840 ttcagaggaa cacctaagac aaaccaggtc ggagaccatg agaaacagcg tcaaatcatc 900 ttttcatgat cccaagctga aaggcaagcc ctccagagag cgttatgtga cccacaaccg 960 agcacattgg tgacagacct tcggggcctg tctgaagcca tagcctccac ggagagccct 1020 gtggccgact ctgcactctc caccctggct gggatcagag caggagcatc ctctgctggt 1080 tcctgactgg caaaggacca gcgtcctcgt tcaaaacatt ccaagaaagg ttaaggagtt 1140 cccccaacca tcttcactgg cttccatcag tggtaactgc tttggtctct tctttcatct 1200 ggggatgaca atggacctct cagcagaaac acacagtcac attcgaattc 1250 4 1240 DNA Homo sapiens 4 ccagcttaat agcaggacgc tggcaacagg cactccctgc tccagtccag cctggcgctc 60 caccgccgct atggtctccg tgcctaccac ctggtgctcc gttgcgctag ccctgctcgt 120 ggccctgcat gaagggaagg gccaggctgc tgccaccctg gagcagccag cgtcctcatc 180 tcatgcccaa ggcacccacc ttcggcttcg ccgttgctcc tgcagctcct ggctcgacaa 240 ggagtgcgtc tacttctgcc acttggacat catctgggtg aacactcctg aacagacagc 300 tccttacggc ctgggaaacc cgccaagacg ccggcgccgc tccctgccaa ggcgctgtca 360 gtgctccagt gccagggacc ccgcctgtgc caccttctgc cttcgaaggc cctggactga 420 agccggggca gtcccaagcc ggaagtcccc tgcagacgtg ttccagactg gcaagacagg 480 ggccactaca ggagagcttc tccaaaggct gagggacatt tccacagtca agagcctctt 540 tgccaagcga caacaggagg ccatgcggga gcctcggtcc acacattcca ggtggaggaa 600 gagatagtgt cgtgagctgg aggaacattg ggaaggaagc ccgcggggag agaggaggag 660 agaagtggcc agggcttgtg gactctctgc ctgcttcctg gaccggggcc ttggtcccag 720 acagctggac ccatttgcca ggattggcac aagctccctg gtgagggagc ctcgtccaag 780 gcagttctgt gtcctcgcac tgcccaggga agccctcggc ctccagactg cggagcagcc 840 tccagtgctg gctgctggcc cacagctctg ctggaagaac tgcatgggga gtacattcat 900 ctggaggctg cgtcctgagg agtgtcctgt ctgctgggct acaaaccagg agcaaccgtg 960 cagccacgaa cacgcatgcc tcagccagcc ctggagactg gatggctccc ctgaggctgg 1020 catcctggct ggctgtgtcc tctccagctt tccctcccca gagttcttgc accctcattc 1080 cctcgggacc ctcccagtga gaagggcctg ctctgctttt cctgtctgta tataacttat 1140 ttgccctaag aactttgaga atcccaatta tttattttaa tgtatttttt agaccctcta 1200 tttacctgcg aacttgtgtt tataataaat gaggaaacat 1240 5 2530 DNA Adenovirus type 5 5 gttaacttgc accagtgcaa aaggggtatc ttttgtctgg taaagcaggc caaagtcacc 60 tacgacagta ataccaccgg acaccgcctt agctacaagt tgccaaccaa gcgtcagaaa 120 ttggtggtca tggtgggaga aaagcccatt accataactc agcactcggt agaaaccgaa 180 ggctgcattc actcaccttg tcaaggacct gaggatctct gcacccttat taagaccctg 240 tgcggtctca aagatcttat tccctttaac taataaaaaa aaataataaa gcatcactta 300 cttaaaatca gttagcaaat ttctgtccag tttattcagc agcacctcct tgccctcctc 360 ccagctctgg tattgcagct tcctcctggc tgcaaacttt ctccacaatc taaatggaat 420 gtcagtttcc tcctgttcct gtccatccgc acccactatc ttcatgttgt tgcagatgaa 480 gcgcgcaaga ccgtctgaag ataccttcaa ccccgtgtat ccatatgaca cggaaaccgg 540 tcctccaact gtgccttttc ttactcctcc ctttgtatcc cccaatgggt ttcaagagag 600 tccccctggg gtactctctt tgcgcctatc cgaacctcta gttacctcca atggcatgct 660 tgcgctcaaa atgggcaacg gcctctctct ggacgaggcc ggcaacctta cctcccaaaa 720 tgtaaccact gtgagcccac ctctcaaaaa aaccaagtca aacataaacc tggaaatatc 780 tgcacccctc acagttacct cagaagccct aactgtggct gccgccgcac ctctaatggt 840 cgcgggcaac acactcacca tgcaatcaca ggccccgcta accgtgcacg actccaaact 900 tagcattgcc acccaaggac ccctcacagt gtcagaagga aagctagccc tgcaaacatc 960 aggccccctc accaccaccg atagcagtac ccttactatc actgcctcac cccctctaac 1020 tactgccact ggtagcttgg gcattgactt gaaagagccc atttatacac aaaatggaaa 1080 actaggacta aagtacgggg ctcctttgca tgtaacagac gacctaaaca ctttgaccgt 1140 agcaactggt ccaggtgtga ctattaataa tacttccttg caaactaaag ttactggagc 1200 cttgggtttt gattcacaag gcaatatgca acttaatgta gcaggaggac taaggattga 1260 ttctcaaaac agacgcctta tacttgatgt tagttatccg tttgatgctc aaaaccaact 1320 aaatctaaga ctaggacagg gccctctttt tataaactca gcccacaact tggatattaa 1380 ctacaacaaa ggcctttact tgtttacagc ttcaaacaat tccaaaaagc ttgaggttaa 1440 cctaagcact gccaaggggt tgatgtttga cgctacagcc atagccatta atgcaggaga 1500 tgggcttgaa tttggttcac ctaatgcacc aaacacaaat cccctcaaaa caaaaattgg 1560 ccatggccta gaatttgatt caaacaaggc tatggttcct aaactaggaa ctggccttag 1620 ttttgacagc acaggtgcca ttacagtagg aaacaaaaat aatgataagc taactttgtg 1680 gaccacacca gctccatctc ctaactgtag actaaatgca gagaaagatg ctaaactcac 1740 tttggtctta acaaaatgtg gcagtcaaat acttgctaca gtttcagttt tggctgttaa 1800 aggcagtttg gctccaatat ctggaacagt tcaaagtgct catcttatta taagatttga 1860 cgaaaatgga gtgctactaa acaattcctt cctggaccca gaatattgga actttagaaa 1920 tggagatctt actgaaggca cagcctatac aaacgctgtt ggatttatgc ctaacctatc 1980 agcttatcca aaatctcacg gtaaaactgc caaaagtaac attgtcagtc aagtttactt 2040 aaacggagac aaaactaaac ctgtaacact aaccattaca ctaaacggta cacaggaaac 2100 aggagacaca actccaagtg catactctat gtcattttca tgggactggt ctggccacaa 2160 ctacattaat gaaatatttg ccacatcctc ttacactttt tcatacattg cccaagaata 2220 aagaatcgtt tgtgttatgt ttcaacgtgt ttatttttca attgcagaaa atttcaagtc 2280 atttttcatt cagtagtata gccccaccac cacatagctt atacagatca ccgtacctta 2340 atcaaactca cagaacccta gtattcaacc tgccacctcc ctcccaacac acagagtaca 2400 cagtcctttc tccccggctg gccttaaaaa gcatcatatc atgggtaaca gacatattct 2460 taggtgttat attccacacg gtttcctgtc gagccaaacg ctcatcagtg atattaataa 2520 actccccggg 2530 6 4679 DNA adeno-associated virus 2 6 ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60 cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120 gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag 180 ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 240 gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 300 ggtttgaacg cgcagccgcc atgccggggt tttacgagat tgtgattaag gtccccagcg 360 accttgacga gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg 420 aatgggagtt gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga 480 ccgtggccga gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc 540 cggaggccct tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc 600 tcgtggaaac caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg 660 aaaaactgat tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg 720 tcacaaagac cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc 780 ccaattactt gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac 840 agtatttaag cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga 900 cgcacgtgtc gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc 960 cggtgatcag atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca 1020 aggggattac ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca 1080 atgcggcctc caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta 1140 tgagcctgac taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt 1200 ccagcaatcg gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt 1260 ccgtctttct gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg 1320 ggcctgcaac taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct 1380 acgggtgcgt aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg 1440 tgatctggtg ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc 1500 tcggaggaag caaggtgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga 1560 ctcccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga 1620 ccttcgaaca ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc 1680 tggatcatga ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa 1740 aggatcacgt ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa 1800 gacccgcccc cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc 1860 agccatcgac gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat 1920 gttctcgtca cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga 1980 atcagaattc aaatatctgc ttcactcacg gacagaaaga ctgtttagag tgctttcccg 2040 tgtcagaatc tcaacccgtt tctgtcgtca aaaaggcgta tcagaaactg tgctacattc 2100 atcatatcat gggaaaggtg ccagacgctt gcactgcctg cgatctggtc aatgtggatt 2160 tggatgactg catctttgaa caataaatga tttaaatcag gtatggctgc cgatggttat 2220 cttccagatt ggctcgagga cactctctct gaaggaataa gacagtggtg gaagctcaaa 2280 cctggcccac caccaccaaa gcccgcagag cggcataagg acgacagcag gggtcttgtg 2340 cttcctgggt acaagtacct cggacccttc aacggactcg acaagggaga gccggtcaac 2400 gaggcagacg ccgcggccct cgagcacgac aaagcctacg accggcagct cgacagcgga 2460 gacaacccgt acctcaagta caaccacgcc gacgcggagt ttcaggagcg ccttaaagaa 2520 gatacgtctt ttgggggcaa cctcggacga gcagtcttcc aggcgaaaaa gagggttctt 2580 gaacctctgg gcctggttga ggaacctgtt aagacggctc cgggaaaaaa gaggccggta 2640 gagcactctc ctgtggagcc agactcctcc tcgggaaccg gaaaggcggg ccagcagcct 2700 gcaagaaaaa gattgaattt tggtcagact ggagacgcag actcagtacc tgacccccag 2760 cctctcggac agccaccagc agccccctct ggtctgggaa ctaatacgat ggctacaggc 2820 agtggcgcac caatggcaga caataacgag ggcgccgacg gagtgggtaa ttcctcggga 2880 aattggcatt gcgattccac atggatgggc gacagagtca tcaccaccag cacccgaacc 2940 tgggccctgc ccacctacaa caaccacctc tacaaacaaa tttccagcca atcaggagcc 3000 tcgaacgaca atcactactt tggctacagc accccttggg ggtattttga cttcaacaga 3060 ttccactgcc acttttcacc acgtgactgg caaagactca tcaacaacaa ctggggattc 3120 cgacccaaga gactcaactt caagctcttt aacattcaag tcaaagaggt cacgcagaat 3180 gacggtacga cgacgattgc caataacctt accagcacgg ttcaggtgtt tactgactcg 3240 gagtaccagc tcccgtacgt cctcggctcg gcgcatcaag gatgcctccc gccgttccca 3300 gcagacgtct tcatggtgcc acagtatgga tacctcaccc tgaacaacgg gagtcaggca 3360 gtaggacgct cttcatttta ctgcctggag tactttcctt ctcagatgct gcgtaccgga 3420 aacaacttta ccttcagcta cacttttgag gacgttcctt tccacagcag ctacgctcac 3480 agccagagtc tggaccgtct catgaatcct ctcatcgacc agtacctgta ttacttgagc 3540 agaacaaaca ctccaagtgg aaccaccacg cagtcaaggc ttcagttttc tcaggccgga 3600 gcgagtgaca ttcgggacca gtctaggaac tggcttcctg gaccctgtta ccgccagcag 3660 cgagtatcaa agacatctgc ggataacaac aacagtgaat actcgtggac tggagctacc 3720 aagtaccacc tcaatggcag agactctctg gtgaatccgg gcccggccat ggcaagccac 3780 aaggacgatg aagaaaagtt ttttcctcag agcggggttc tcatctttgg gaagcaaggc 3840 tcagagaaaa caaatgtgga cattgaaaag gtcatgatta cagacgaaga ggaaatcagg 3900 acaaccaatc ccgtggctac ggagcagtat ggttctgtat ctaccaacct ccagagaggc 3960 aacagacaag cagctaccgc agatgtcaac acacaaggcg ttcttccagg catggtctgg 4020 caggacagag atgtgtacct tcaggggccc atctgggcaa agattccaca cacggacgga 4080 cattttcacc cctctcccct catgggtgga ttcggactta aacaccctcc tccacagatt 4140 ctcatcaaga acaccccggt acctgcgaat ccttcgacca ccttcagtgc ggcaaagttt 4200 gcttccttca tcacacagta ctccacggga caggtcagcg tggagatcga gtgggagctg 4260 cagaaggaaa acagcaaacg ctggaatccc gaaattcagt acacttccaa ctacaacaag 4320 tctgttaatg tggactttac tgtggacact aatggcgtgt attcagagcc tcgccccatt 4380 ggcaccagat acctgactcg taatctgtaa ttgcttgtta atcaataaac cgtttaattc 4440 gtttcagttg aactttggtc tctgcgtatt tctttcttat ctagtttcca tggctacgta 4500 gataagtagc atggcgggtt aatcattaac tacaaggaac ccctagtgat ggagttggcc 4560 actccctctc tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc 4620 ccgggctttg cccgggcggc ctcagtgagc gagcgagcgc gcagagaggg agtggccaa 4679 7 7399 DNA Coxsackievirus B3 7 ttaaaacagc ctgtgggttg atcccaccca caggcccatt gggcgctagc actctggtat 60 cacggtacct ttgtgcgcct gttttatacc ccctccccca actgtaactt agaagtaaca 120 cacaccgatc aacagtcagc gtggcacacc agccacgttt tgatcaagca cttctgttac 180 cccggactga gtatcaatag actgctcacg cggttgaagg agaaagcgtt cgttatccgg 240 ccaactactt cgaaaaacct agtaacaccg tggaagttgc agagtgtttc gctcagcact 300 accccagtgt agatcaggtc gatgagtcac cgcattcccc acgggcgacc gtggcggtgg 360 ctgcgttggc ggcctgccca tggggaaacc catgggacgc tctaatacag acatggtgcg 420 aagagtctat tgagctagtt ggtagtcctc cggcccctga atgcggctaa tcctaactgc 480 ggagcacaca ccctcaagcc agagggcagt gtgtcgtaac gggcaactct gcagcggaac 540 cgactacttt gggtgtccgt gtttcatttt attcctatac tggctgctta tggtgacaat 600 tgagagatcg ttaccatata gctattggat tggccatccg gtgactaata gagctattat 660 atatcccttt gttgggttta taccacttag cttgaaagag gttaaaacat tacaattcat 720 tgttaagttg aatacagcaa aatgggagct caagtatcaa cgcaaaagac tggggcacat 780 gagaccaggc tgaatgctag cggcaattcc atcattcact acacaaatat taattattac 840 aaggatgccg catccaactc agccaatcgg caggatttca ctcaagaccc gggcaagttc 900 acagaaccag tgaaagatat catgattaaa tcactaccag ctctcaactc ccccacagta 960 gaggagtgcg gatacagtga cagggcgaga tcaatcacat taggtaactc caccataacg 1020 actcaggaat gcgccaacgt ggtggtgggc tatggagtat ggccagatta tctaaaggat 1080 agtgaggcaa cagcagagga ccaaccgacc caaccagacg ttgccacatg taggttctat 1140 acccttgact ctgtgcaatg gcagaaaacc tcaccaggat ggtggtggaa gctgcccgat 1200 gctttgtcga acttaggact gtttgggcag aacatgcagt accactactt aggccgaact 1260 gggtataccg tacatgtgca gtgcaatgca tctaagttcc accaaggatg cttgctagta 1320 gtgtgtgtac cggaagctga gatgggttgc gcaacgctag acaacacccc atccagtgca 1380 gaattgctgg ggggcgatac ggcaaaggag tttgcggaca aaccggtcgc atccgggtcc 1440 aacaagttgg tacagagggt ggtgtataat gcaggcatgg gggtgggtgt tggaaacctc 1500 accattttcc cccaccaatg gatcaaccta cgcaccaata atagtgctac aattgtgatg 1560 ccatacacca acagtgtacc tatggataac atgtttaggc ataacaacgt caccctaatg 1620 gttatcccat ttgtaccgct agattactgc cctgggtcca ccacgtacgt cccaattacg 1680 gtcacgatag ccccaatgtg tgccgagtac aatgggttac gtttagcagg gcaccagggc 1740 ttaccaacca tgaatactcc ggggagctgt caatttctga catcagacga cttccaatca 1800 ccatccgcca tgccgcaata tgacgtcaca ccagagatga ggatacctgg tgaggtgaaa 1860 aacttgatgg aaatagctga ggttgactca gttgtcccag tccaaaatgt tggagagaag 1920 gtcaactcta tggaagcata ccagatacct gtgagatcca acgaaggatc tggaacgcaa 1980 gtattcggct ttccactgca accagggtac tcgagtgttt ttagtcggac gctcctagga 2040 gagatcttga actattatac acattggtca ggcagcataa agcttacgtt tatgttctgt 2100 ggttcggcca tggctactgg aaaattcctt ttggcatact caccaccagg tgctggagct 2160 cctacaaaaa gggttgatgc tatgcttggt actcatgtaa tttgggacgt ggggctacaa 2220 tcaagttgcg tgctgtgtat accctggata agccaaacac actaccggtt tgttgcttca 2280 gatgagtata ccgcaggggg ttttattacg tgctggtatc aaacaaacat agtggtccca 2340 gcggatgccc aaagctcctg ttacatcatg tgtttcgtgt cagcatgcaa tgacttctct 2400 gtcaggctat tgaaggacac tcctttcatt tcgcagcaaa actttttcca gggcccagtg 2460 gaagacgcga taacagccgc tatagggaga gttgcggata ccgtgggtac agggccaacc 2520 aactcagaag ctataccagc actcactgct gctgagacgg gtcacacgtc acaagtagtg 2580 ccgggtgaca ctatgcagac acgccacgtt aagaactacc attcaaggtc cgagtcaacc 2640 atagagaact tcctatgtag gtcagcatgc gtgtacttta cggagtataa aaactcaggt 2700 gccaagcggt atgctgaatg ggtattaaca ccacgacaag cagcacaact taggagaaag 2760 ctagaattct ttacctacgt ccggttcgac ctggagctga cgtttgtcat aacaagtact 2820 caacagccct caaccacaca gaaccaagat gcacagatcc taacacacca aattatgtat 2880 gtaccaccag gtggacctgt accagataaa gttgattcat acgtgtggca aacatctacg 2940 aatcccagtg tgttttggac cgagggaaac gccccgccgc gcatgtccat accgtttttg 3000 agcattggca acgcctattc aaatttctat gacggatggt ctgaattttc caggaacgga 3060 gtttacggca tcaacacgct aaacaacatg ggcacgctat atgcaagaca tgtcaacgct 3120 ggaagcacgg gtccaataaa aagcaccatt agaatctact tcaaaccgaa gcatgtcaaa 3180 gcgtggatac ctagaccacc tagactctgc caatacgaga aggcaaagaa cgtgaacttc 3240 caacccagcg gagttaccac tactaggcaa agcatcacta caatgacaaa tacgggcgca 3300 tttggacaac aatcaggggc agtgtatgtg gggaactaca gggtggtaaa tagacatcta 3360 gctaccagtg ctgactggca aaactgtgtg tgggaaagtt acaacagaga cctcttagtg 3420 agcacgacca cagcacatgg atgtgatatt atagccagat gtcagtgcac aacgggagtg 3480 tacttttgtg cgtccaaaaa caagcactac ccaatttcgt ttgaaggacc aggtctagta 3540 gaggtccaag agagtgaata ctaccccagg agataccaat cccatgtgct tttagcagct 3600 ggattttccg aaccaggtga ctgtggcggt atcctaaggt gtgagcatgg tgtcattggc 3660 attgtgacca tggggggtga aggcgtggtc ggctttgcag acatccgtga tctcctgtgg 3720 ctggaagatg atgcaatgga acagggagtg aaggactatg tggaacagct tggaaatgca 3780 ttcggctccg gctttactaa ccaaatatgt gagcaagtca acctcctgaa agaatcacta 3840 gtgggtcaag actccatctt agagaaatct ctaaaagcct tagttaagat aatatcagcc 3900 ttagtaattg tggtgaggaa ccacgatgac ctgatcactg tgactgccac actagccctt 3960 atcggttgta cctcgtcccc gtggcggtgg ctcaaacaga aggtgtcaca atattacgga 4020 atccctatgg ctgaacgcca aaacaatagc tggcttaaga aatttactga aatgacaaat 4080 gcttgcaagg gtatggaatg gatagctgtc aaaattcaga aattcattga atggctcaaa 4140 gtaaaaattt tgccagaggt cagagaaaaa cacgagttcc tgaacagact taaacaactc 4200 cccttattag aaagtcagat cgccacaatc gagcagagcg cgccatccca aagtgaccag 4260 gaacaattat tttccaatgt ccaatacttt gcccactatt gcagaaagta cgctcccctc 4320 tacgcagctg aagcaaagag ggtgttctcc cttgagaaga agatgagcaa ttacatacag 4380 ttcaagtcca aatgccgtat tgaacctgta tgtttgctcc tgcacgggag ccctggtgcc 4440 ggcaagtcgg tggcaacaaa cttaattgga aggtcgcttg ctgagaaact caacagctca 4500 gtgtactcac taccgccaga cccagatcac ttcgacggat acaaacagca ggccgtggtg 4560 attatggacg atctatgcca gaatcctgat gggaaagacg tctccttgtt ctgccaaatg 4620 gtttccagtg tagattttgt accacccatg gctgccctag aagagaaagg cattctgttc 4680 acctcaccgt ttgtcttggc atcgaccaat gcaggatcta ttaatgctcc aaccgtgtca 4740 gatagcagag ccttggcaag gagatttcac tttgacatga acatcgaggt tatttccatg 4800 tacagtcaga atggcaagat aaacatgccc atgtcagtca agacttgtga cgatgagtgt 4860 tgcccggtca attttaaaaa gtgctgccct cttgtgtgtg ggaaggctat acaattcatt 4920 gatagaagaa cacaggtcag atactctcta gacatgctag tcaccgagat gtttagggag 4980 tacaatcata gacatagcgt ggggaccacg cttgaggcac tgttccaggg accaccagta 5040 tacagagaga tcaaaattag cgttgcacca gagacaccac caccgcccgc cattgcggac 5100 ctgctcaaat cggtagacag tgaggctgtg agggagtact gcaaagaaaa aggatggttg 5160 gttcctgaga tcaactccac cctccaaatt gagaaacatg tcagtcgggc tttcatttgc 5220 ttacaggcat tgaccacatt tgtgtcagtg gctggaatca tatatataat atataagctc 5280 tttgcgggtt ttcaaggtgc ttatacagga gtgcccaacc agaagcccag agtgcctacc 5340 ctgaggcaag caaaagtgca aggccctgcc tttgagttcg ccgtcgcaat gatgaaaagg 5400 aactcaagca cggtgaaaac tgaatatggc gagtttacca tgctgggcat ctatgacagg 5460 tgggccgttt tgccacgcca cgccaaacct gggccaacca tcttgatgaa tgatcaagag 5520 gttggtgtgc tagatgccaa ggagctagta gacaaggacg gcaccaactt agaactgaca 5580 ctactcaaat tgaaccggaa tgagaagttc agagacatca gaggcttctt agccaaggag 5640 gaagtggagg ttaatgaggc agtgctagca attaacacca gcaagtttcc caacatgtac 5700 attccagtag gacaggtcac agaatacggc ttcctaaacc taggtggcac acccaccaag 5760 agaatgctta tgtacaactt ccccacaaga gcaggccagt gtggtggagt gctcatgtcc 5820 accggcaagg tactgggtat ccatgttggt ggaaatggcc atcagggctt ctcagcagca 5880 ctcctcaaac actacttcaa tgatgagcaa ggtgaaatag aatttattga gagctcaaag 5940 gacgccgggt ttccagtcat caacacacca agtaaaacaa agttggagcc tagtgttttc 6000 caccaggtct ttgaggggaa caaagaacca gcagtactca ggagtgggga tccacgtctc 6060 aaggccaatt ttgaagaggc tatattttcc aagtatatag gaaatgtcaa cacacacgtg 6120 gatgagtaca tgctggaagc agtggaccac tacgcaggcc aactagccac cctagatatc 6180 agcactgaac caatgaaact ggaggacgca gtgtacggta ccgagggtct tgaggcgctt 6240 gatctaacaa cgagtgccgg ttacccatat gttgcactgg gtatcaagaa gagggacatc 6300 ctctctaaga agactaagga cctaacaaag ttaaaggaat gtatggacaa gtatggcctg 6360 aacctaccaa tggtgactta tgtaaaagat gagctcaggt ccatagagaa ggtagcgaaa 6420 ggaaagtcta ggctgattga ggcgtccagt ttgaatgatt cagtggcgat gagacagaca 6480 tttggtaatc tgtacaaaac tttccaccta aacccagggg ttgtgactgg tagtgctgtt 6540 gggtgtgacc cagacctctt ttggagcaag ataccagtga tgttagatgg acatctcata 6600 gcatttgatt actctgggta cgatgctagc ttaagccctg tctggtttgc ttgcctaaaa 6660 atgttacttg agaagcttgg atacacgcac aaagagacaa actacattga ctacttgtgc 6720 aactcccatc acctgtacag ggataaacat tactttgtga ggggtggcat gccctcggga 6780 tgttctggta ccagtatttt caactcaatg attaacaata tcataattag gacactaatg 6840 ctaaaagtgt acaaagggat tgacttggac caattcagga tgatcgcata tggtgatgat 6900 gtgatcgcat cgtacccatg gcctatagat gcatctttac tcgctgaagc tggtaagggt 6960 tacgggctga tcatgacacc agcagataag ggagagtgct ttaacgaagt tacctggacc 7020 aacgccactt tcctaaagag gtattttaga gcagatgaac agtacccctt cctggtgcat 7080 cctgttatgc ccatgaaaga catacacgaa tcaattagat ggaccaagga tccaaagaac 7140 acccaagatc acgtgcgctc actgtgtcta ttagcttggc ataacgggga gcacgaatat 7200 gaggagttca tccgtaaaat tagaagcgtc ccagtcggac gttgtttgac cctccccgcg 7260 ttttcaactc tacgcaggaa gtggttggac tccttttaga ttagagacaa tttgaaataa 7320 tttagattgg cttaacccta ctgtgctaac cgaaccagat aacggtacag taggggtaaa 7380 ttctccgcat tcggtgcgg 7399 8 2537 DNA Homo sapiens 8 cgccgccgcg agccagtcgg gagcgcgcga ggcgcgggga gcctgggacc aggagcgaga 60 gccgcctacc tgcagccgcc gcccacggca cggcagccac catggcgctc ctgctgtgct 120 tcgtgctcct gtgcggagta gtggatttcg ccagaagttt gagtatcact actcctgaag 180 agatgattga aaaagccaaa ggggaaactg cctatctgcc gtgcaaattt acgcttagtc 240 ccgaagacca gggaccgctg gacatcgagt ggctgatatc accagctgat aatcagaagg 300 tggatcaagt gattatttta tattctggag acaaaattta tgatgactac tatccagatc 360 tgaaaggccg agtacatttt acgagtaatg atctcaaatc tggtgatgca tcaataaatg 420 taacgaattt acaactgtca gatattggca catatcagtg caaagtgaaa aaagctcctg 480 gtgttgcaaa taagaagatt catctggtag ttcttgttaa gccttcaggt gcgagatgtt 540 acgttgatgg atctgaagaa attggaagtg actttaagat aaaatgtgaa ccaaaagaag 600 gttcacttcc attacagtat gagtggcaaa aattgtctga ctcacagaaa atgcccactt 660 catggttagc agaaatgact tcatctgtta tatctgtaaa aaatgcctct tctgagtact 720 ctgggacata cagctgtaca gtcagaaaca gagtgggctc tgatcagtgc ctgttgcgtc 780 taaacgttgt ccctccttca aataaagctg gactaattgc aggagccatt ataggaactt 840 tgcttgctct agcgctcatt ggtcttatca tcttttgctg tcgtaaaaag cgcagagaag 900 aaaaatatga aaaggaagtt catcacgata tcagggaaga tgtgccacct ccaaagagcc 960 gtacgtccac tgccagaagc tacatcggca gtaatcattc atccctgggg tccatgtctc 1020 cttccaacat ggaaggatat tccaagactc agtataacca agtaccaagt gaagactttg 1080 aacgcactcc tcagagtccg actctcccac ctgctaaggt agctgcccct aatctaagtc 1140 gaatgggtgc gattcctgtg atgattccag cacagagcaa ggatgggtct atagtataga 1200 gcctccatat gtctcatctg tgctctccgt gttcctttcc tttttttgat atatgaaaac 1260 ctattctggt ctaaattgtg ttactagcct caaaatacat caaaaaataa gttaatcagg 1320 aactgtacgg aatatatttt taaaaatttt tgtttggtta tatcgaaata gttacaggca 1380 ctaaagttag taaagaaaag tttaccatct gaaaaagctg gattttcttt aagaggttga 1440 ttataaagtt ttctaaattt atcagtacct aagtaagatg tagcgctttg aatatgaaat 1500 cataggtgaa gacatgggtg aacttacttg cataccaagt tgatacttga ataaccatct 1560 gaaagtggta cttgatcatt tttaccatta tttttaggat gtgtatttca tttatttatg 1620 gcccaccagt ctcccccaaa ttagtacaga aatatccatg acaaaattac ttacgtatgt 1680 ttgtacttgg ttttacagct cctttgaaaa ctctgtgttt ggaatatctc taaaaacata 1740 gaaaacacta cagtggttta gaaattacta attttacttc taagtcattc ataaaccttg 1800 tctatgaaat gacttcttaa atatttagtt gatagactgc tacaggtaat agggacttag 1860 caagctcttt tatatgctaa aggagcatct atcagattaa gttagaacat ttgctgtcag 1920 ccacatattg agatgacact aggtgcaata gcagggatag attttgttgg tgagtagtct 1980 catgccttga gatctgtggt ggtcttcaaa atggtggcca gccagatcaa ggatgtagta 2040 tctcatagtt cccaggtgat atttttctta ttagaaaaat attataactc atttgttgtt 2100 tgacacttat agattgaaat ttcctaattt attctaaatt ttaagtggtt ctttggttcc 2160 agtgctttat gttgttgttg tttttggatg gtgttacata ttatatgttc tagaaacatg 2220 taatcctaaa tttaccctct tgaatataat ccctggatga tattttttat cataaatgca 2280 gaataatcaa atacatttta agcaagttaa gtgtcctcca tcaattctgt attccagact 2340 tgggaggatg tacagttgct gttgtgtgat caaacatgtc tctgtgtagt tccagcaaat 2400 caagctgagc tttgaaaaag tttgtcttag ttttgtgaag gtgatttatt cttaaaaaaa 2460 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2520 aaaaaaaaaa aaaaaaa 2537 9 11 PRT synthetic sequence 9 Ala Cys Asp Cys Arg Gly Asp Cys Phe Cys Gly 1 5 10 10 1181 DNA Homo sapiens 10 ggcacgaggg cgatgcgggc gcccccggcg ggcggccccg gcgggcacca tgagccctct 60 gctccgccgc ctgctgctcg ccgcactcct gcagctggcc cccgcccagg cccctgtctc 120 ccagcctgat gcccctggcc accagaggaa agtggtgtca tggatagatg tgtatactcg 180 cgctacctgc cagccccggg aggtggtggt gcccttgact gtggagctca tgggcaccgt 240 ggccaaacag ctggtgccca gctgcgtgac tgtgcagcgc tgtggtggct gctgccctga 300 cgatggcctg gagtgtgtgc ccactgggca gcaccaagtc cggatgcaga tcctcatgat 360 ccggtacccg agcagtcagc tgggggagat gtccctggaa gaacacagcc agtgtgaatg 420 cagacctaaa aaaaaggaca gtgctgtgaa gccagacagg gctgccactc cccaccaccg 480 tccccagccc cgttctgttc cgggctggga ctctgccccc ggagcaccct ccccagctga 540 catcacccat cccactccag ccccaggccc ctctgcccac gctgcaccca gcaccaccag 600 cgccctgacc cccggacctg ccgctgccgc tgccgacgcc gcagcttcct ccgttgccaa 660 gggcggggct tagagctcaa cccagacacc tgcaggtgcc ggaagctgcg aaggtgacac 720 atggcttttc agactcagca gggtgacttg cctcagaggc tatatcccag tgggggaaca 780 aagaggagcc tggtaaaaaa cagccaagcc cccaagacct cagcccaggc agaagctgct 840 ctaggacctg ggcctctcag agggctcttc tgccatccct tgtctccctg aggccatcat 900 caaacaggac agagttggaa gaggagactg ggaggcagca agaggggtca cataccagct 960 caggggagaa tggagtactg tctcagtttc taaccactct gtgcaagtaa gcatcttaca 1020 actggctctt cctcccctca ctaagaagac ccaaacctct gcataatggg atttgggctt 1080 tggtacaaga actgtgaccc ccaaccctga taaaagagat ggaaggaaaa aaaaaaaaaa 1140 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 1181 11 2557 DNA Homo sapiens 11 gaattccggg agaagtgacc agagcaattt ctgcttttca cagggcgggt ttctcaacgg 60 tgacttgtgg gcagtgcctt ctgctgagcg agtcatggcc cgaaggcaga actaactgtg 120 cctgcagtct tcactctcag gatgcagccg aggtgggccc aaggggccac gatgtggctt 180 ggagtcctgc tgacccttct gctctgttca agccttgagg gtcaagaaaa ctctttcaca 240 atcaacagtg ttgacatgaa gagcctgccg gactggacgg tgcaaaatgg gaagaacctg 300 accctgcagt gcttcgcgga tgtcagcacc acctctcacg tcaagcctca gcaccagatg 360 ctgttctata aggatgacgt gctgttttac aacatctcct ccatgaagag cacagagagt 420 tattttattc ctgaagtccg gatctatgac tcagggacat ataaatgtac tgtgattgtg 480 aacaacaaag agaaaaccac tgcagagtac cagctgttgg tggaaggagt gcccagtccc 540 agggtgacac tggacaagaa agaggccatc caaggtggga tcgtgagggt caactgttct 600 gtcccagagg aaaaggcccc aatacacttc acaattgaaa aacttgaact aaatgaaaaa 660 atggtcaagc tgaaaagaga gaagaattct cgagaccaga attttgtgat actggaattc 720 cccgttgagg aacaggaccg cgttttatcc ttccgatgtc aagctaggat catttctggg 780 atccatatgc agacctcaga atctaccaag agtgaactgg tcaccgtgac ggaatccttc 840 tctacaccca agttccacat cagccccacc ggaatgatca tggaaggagc tcagctccac 900 attaagtgca ccattcaagt gactcacctg gcccaggagt ttccagaaat cataattcag 960 aaggacaagg cgattgtggc ccacaacaga catggcaaca aggctgtgta ctcagtcatg 1020 gccatggtgg agcacagtgg caactacacg tgcaaagtgg agtccagccg catatccaag 1080 gtcagcagca tcgtggtcaa cataacagaa ctattttcca agcccgaact ggaatcttcc 1140 ttcacacatc tggaccaagg tgaaagactg aacctgtcct gctccatccc aggagcacct 1200 ccagccaact tcaccatcca gaaggaagat acgattgtgt cacagactca agatttcacc 1260 aagatagcct caaagtcgga cagtgggacg tatatctgca ctgcaggtat tgacaaagtg 1320 gtcaagaaaa gcaacacagt ccagatagtc gtatgtgaaa tgctctccca gcccaggatt 1380 tcttatgatg cccagtttga ggtcataaaa ggacagacca tcgaagtccg ttgcgaatcg 1440 atcagtggaa ctttgcctat ttcttaccaa cttttaaaaa caagtaaagt tttggagaat 1500 agtaccaaga actcaaatga tcctgcggta ttcaaagaca accccactga agacgtcgaa 1560 taccagtgtg ttgcagataa ttgccattcc catgccaaaa tgttaagtga ggttctgagg 1620 gtgaaggtga tagccccggt ggatgaggtc cagatttcta tcctgtcaag taaggtggtg 1680 gagtctggag aggacattgt gctgcaatgt gctgtgaatg aaggatctgg tcccatcacc 1740 tataagtttt acagagaaaa agagggcaaa cccttctatc aaatgacctc aaatgccacc 1800 caggcatttt ggaccaagca gaaggctagc aaggaacagg agggagagta ttactgcaca 1860 gccttcaaca gagccaacca cgcctccagt gtccccagaa gcaaaatact gacagtcaga 1920 gtcattcttg ccccatggaa gaaaggactt attgcagtgg ttatcatcgg agtgatcatt 1980 gctctcttga tcattgcggc caaatgttat tttctgagga aagccaaggc caagcagatg 2040 ccagtggaaa tgtccaggcc agcagtacca cttctgaact ccaacaacga gaaaatgtca 2100 gatcccaata tggaagctaa cagtcattac ggtcacaatg acgatgtcag aaaccatgca 2160 atgaaaccaa taaatgataa taaagagcct ctgaactcag acgtgcagta cacggaagtt 2220 caagtgtcct cagctgagtc tcacaaagat ctaggaaaga aggacacaga gacagtgtac 2280 agtgaagtcc ggaaagctgt ccctgatgcc gtggaaagca gatactctag aacggaaggc 2340 tcccttgatg gaacttagac agcaaggcca gatgcacatc cctggaagga catccatgtt 2400 ccgagaagaa cagataatcc ctgtatttca agacctctgt gcacttattt atgaacctgc 2460 cctgctccca cagaacacag caattcctca ggctaagctg ccggttctta aatccatcct 2520 gctaagttaa tgttgggtag aaagagatac agagggg 2557 12 498 PRT Homo sapiens 12 Met Thr Val Phe Leu Ser Phe Ala Phe Leu Ala Ala Ile Leu Thr His 1 5 10 15 Ile Gly Cys Ser Asn Gln Arg Arg Ser Pro Glu Asn Ser Gly Arg Arg 20 25 30 Tyr Asn Arg Ile Gln His Gly Gln Cys Ala Tyr Thr Phe Ile Leu Pro 35 40 45 Glu His Asp Gly Asn Cys Arg Glu Ser Thr Thr Asp Gln Tyr Asn Thr 50 55 60 Asn Ala Leu Gln Arg Asp Ala Pro His Val Glu Pro Asp Phe Ser Ser 65 70 75 80 Gln Lys Leu Gln His Leu Glu His Val Met Glu Asn Tyr Thr Gln Trp 85 90 95 Leu Gln Lys Leu Glu Asn Tyr Ile Val Glu Asn Met Lys Ser Glu Met 100 105 110 Ala Gln Ile Gln Gln Asn Ala Val Gln Asn His Thr Ala Thr Met Leu 115 120 125 Glu Ile Gly Thr Ser Leu Leu Ser Gln Thr Ala Glu Gln Thr Arg Lys 130 135 140 Leu Thr Asp Val Glu Thr Gln Val Leu Asn Gln Thr Ser Arg Leu Glu 145 150 155 160 Ile Gln Leu Leu Glu Asn Ser Leu Ser Thr Tyr Lys Leu Glu Lys Gln 165 170 175 Leu Leu Gln Gln Thr Asn Glu Ile Leu Lys Ile His Glu Lys Asn Ser 180 185 190 Leu Leu Glu His Lys Ile Leu Glu Met Glu Gly Lys His Lys Glu Glu 195 200 205 Leu Asp Thr Leu Lys Glu Glu Lys Glu Asn Leu Gln Gly Leu Val Thr 210 215 220 Arg Gln Thr Tyr Ile Ile Gln Glu Leu Glu Lys Gln Leu Asn Arg Ala 225 230 235 240 Thr Thr Asn Asn Ser Val Leu Gln Lys Gln Gln Leu Glu Leu Met Asp 245 250 255 Thr Val His Asn Leu Val Asn Leu Cys Thr Lys Glu Gly Val Leu Leu 260 265 270 Lys Gly Gly Lys Arg Glu Glu Glu Lys Pro Phe Arg Asp Cys Ala Asp 275 280 285 Val Tyr Gln Ala Gly Phe Asn Lys Ser Gly Ile Tyr Thr Ile Tyr Ile 290 295 300 Asn Asn Met Pro Glu Pro Lys Lys Val Phe Cys Asn Met Asp Val Asn 305 310 315 320 Gly Gly Gly Trp Thr Val Ile Gln His Arg Glu Asp Gly Ser Leu Asp 325 330 335 Phe Gln Arg Gly Trp Lys Glu Tyr Lys Met Gly Phe Gly Asn Pro Ser 340 345 350 Gly Glu Tyr Trp Leu Gly Asn Glu Phe Ile Phe Ala Ile Thr Ser Gln 355 360 365 Arg Gln Tyr Met Leu Arg Ile Glu Leu Met Asp Trp Glu Gly Asn Arg 370 375 380 Ala Tyr Ser Gln Tyr Asp Arg Phe His Ile Gly Asn Glu Lys Gln Asn 385 390 395 400 Tyr Arg Leu Tyr Leu Lys Gly His Thr Gly Thr Ala Gly Lys Gln Ser 405 410 415 Ser Leu Ile Leu His Gly Ala Asp Phe Ser Thr Lys Asp Ala Asp Asn 420 425 430 Asp Asn Cys Met Cys Lys Cys Ala Leu Met Leu Thr Gly Gly Trp Trp 435 440 445 Phe Asp Ala Cys Gly Pro Ser Asn Leu Asn Gly Met Phe Tyr Thr Ala 450 455 460 Gly Gln Asn His Gly Lys Leu Asn Gly Ile Lys Trp His Tyr Phe Lys 465 470 475 480 Gly Pro Ser Tyr Ser Leu Arg Ser Thr Thr Met Met Ile Arg Pro Leu 485 490 495 Asp Phe 13 496 PRT Homo sapiens 13 Met Trp Gln Ile Val Phe Phe Thr Leu Ser Cys Asp Leu Val Leu Ala 1 5 10 15 Ala Ala Tyr Asn Asn Phe Arg Lys Ser Met Asp Ser Ile Gly Lys Lys 20 25 30 Gln Tyr Gln Val Gln His Gly Ser Cys Ser Tyr Thr Phe Leu Leu Pro 35 40 45 Glu Met Asp Asn Cys Arg Ser Ser Ser Ser Pro Tyr Val Ser Asn Ala 50 55 60 Val Gln Arg Asp Ala Pro Leu Glu Tyr Asp Asp Ser Val Gln Arg Leu 65 70 75 80 Gln Val Leu Glu Asn Ile Met Glu Asn Asn Thr Gln Trp Leu Met Lys 85 90 95 Leu Glu Asn Tyr Ile Gln Asp Asn Met Lys Lys Glu Met Val Glu Ile 100 105 110 Gln Gln Asn Ala Val Gln Asn Gln Thr Ala Val Met Ile Glu Ile Gly 115 120 125 Thr Asn Leu Leu Asn Gln Thr Ala Glu Gln Thr Arg Lys Leu Thr Asp 130 135 140 Val Glu Ala Gln Val Leu Asn Gln Thr Thr Arg Leu Glu Leu Gln Leu 145 150 155 160 Leu Glu His Ser Leu Ser Thr Asn Lys Leu Glu Lys Gln Ile Leu Asp 165 170 175 Gln Thr Ser Glu Ile Asn Lys Leu Gln Asp Lys Asn Ser Phe Leu Glu 180 185 190 Lys Lys Val Leu Ala Met Glu Asp Lys His Ile Ile Gln Leu Gln Ser 195 200 205 Ile Lys Glu Glu Lys Asp Gln Leu Gln Val Leu Val Ser Lys Gln Asn 210 215 220 Ser Ile Ile Glu Glu Leu Glu Lys Lys Ile Val Thr Ala Thr Val Asn 225 230 235 240 Asn Ser Val Leu Gln Lys Gln Gln His Asp Leu Met Glu Thr Val Asn 245 250 255 Asn Leu Leu Thr Met Met Ser Thr Ser Asn Ser Ala Lys Asp Pro Thr 260 265 270 Val Ala Lys Glu Glu Gln Ile Ser Phe Arg Asp Cys Ala Glu Val Phe 275 280 285 Lys Ser Gly His Thr Thr Asn Gly Ile Tyr Thr Leu Thr Phe Pro Asn 290 295 300 Ser Thr Glu Glu Ile Lys Ala Tyr Cys Asp Met Glu Ala Gly Gly Gly 305 310 315 320 Gly Trp Thr Ile Ile Gln Arg Arg Glu Asp Gly Ser Val Asp Phe Gln 325 330 335 Arg Thr Trp Lys Glu Tyr Lys Val Gly Phe Gly Asn Pro Ser Gly Glu 340 345 350 Tyr Trp Leu Gly Asn Glu Phe Val Ser Gln Leu Thr Asn Gln Gln Arg 355 360 365 Tyr Val Leu Lys Ile His Leu Lys Asp Trp Glu Gly Asn Glu Ala Tyr 370 375 380 Ser Leu Tyr Glu His Phe Tyr Leu Ser Ser Glu Glu Leu Asn Tyr Arg 385 390 395 400 Ile His Leu Lys Gly Leu Thr Gly Thr Ala Gly Lys Ile Ser Ser Ile 405 410 415 Ser Gln Pro Gly Asn Asp Phe Ser Thr Lys Asp Gly Asp Asn Asp Lys 420 425 430 Cys Ile Cys Lys Cys Ser Gln Met Leu Thr Gly Gly Trp Trp Phe Asp 435 440 445 Ala Cys Gly Pro Ser Asn Leu Asn Gly Met Tyr Tyr Pro Gln Arg Gln 450 455 460 Asn Thr Asn Lys Phe Asn Gly Ile Lys Trp Tyr Tyr Trp Lys Gly Ser 465 470 475 480 Gly Tyr Ser Leu Lys Ala Thr Thr Met Met Ile Arg Pro Ala Asp Phe 485 490 495 14 1480 DNA Homo sapiens 14 gcggagaaag ccagtgggaa cccagaccca taggagaccc gcgtccccgc tcggcctggc 60 caggccccgc gctatggagt tcctctgggc ccctctcttg ggtctgtgct gcagtctggc 120 cgctgctgat cgccacaccg tcttctggaa cagttcaaat cccaagttcc ggaatgagga 180 ctacaccata catgtgcagc tgaatgacta cgtggacatc atctgtccgc actatgaaga 240 tcactctgtg gcagacgctg ccatggagca gtacatactg tacctggtgg agcatgagga 300 gtaccagctg tgccagcccc agtccaagga ccaagtccgc tggcagtgca accggcccag 360 tgccaagcat ggcccggaga agctgtctga gaagttccag cgcttcacac ctttcaccct 420 gggcaaggag ttcaaagaag gacacagcta ctactacatc tccaaaccca tccaccagca 480 tgaagaccgc tgcttgaggt tgaaggtgac tgtcagtggc aaaatcactc acagtcctca 540 ggcccatgtc aatccacagg agaagagact tgcagcagat gacccagagg tgcgggttct 600 acatagcatc ggtcacagtg ctgccccacg cctcttccca cttgcctgga ctgtgctgct 660 ccttccactt ctgctgctgc aaaccccgtg aaggtgtatg ccacacctgg ccttaaagag 720 ggacaggctg aagagaggga caggcactcc aaacctgtct tggggccact ttcagagccc 780 ccagccctgg gaaccactcc caccacaggc ataagctatc acctagcagc ctcaaaacgg 840 gtcagtatta aggttttcaa ccggaaggag gccaaccagc ccgacagtgc catccccacc 900 ttcacctcgg agggacggag aaagaagtgg agacagtcct ttcccaccat tcctgccttt 960 aagccaaaga aacaagctgt gcaggcatgg tcccttaagg cacagtggga gctgagctgg 1020 aaggggccac gtggatgggc aaagcttgtc aaagatgccc cctccaggag agagccagga 1080 tgcccagatg aactgactga aggaaaagca agaaacagtt tcttgcttgg aagccaggta 1140 caggagaggc agcatgcttg ggctgaccca gcatctccca gcaagacctc atctgtggag 1200 ctgccacaga gaagtttgta gccaggtact gcattctctc ccatcctggg gcagcactcc 1260 ccagagctgt gccagcaggg gggctgtgcc aacctgttct tagagtgtag ctgtaagggc 1320 agtgcccatg tgtacattct gcctagagtg tagcctaaag ggcagggccc acgtgtatag 1380 tatctgtata taagttgctg tgtgtctgtc ctgatttcta caactggagt ttttttatac 1440 aatgttcttt gtctcaaaat aaagcaatgt gttttttcgg 1480 15 2323 DNA Homo sapiens 15 ctccctttgg gcaaggacct gagacccttg tgctaagtca agaggctcaa tgggctgcag 60 aagaactaga gaaggaccaa gcaaagccat gatatttcca tggaaatgtc agagcaccca 120 gagggactta tggaacatct tcaagttgtg ggggtggaca atgctctgtt gtgatttcct 180 ggcacatcat ggaaccgact gctggactta ccattattct gaaaaaccca tgaactggca 240 aagggctaga agattctgcc gagacaatta cacagattta gttgccatac aaaacaaggc 300 ggaaattgag tatctggaga agactctgcc tttcagtcgt tcttactact ggataggaat 360 ccggaagata ggaggaatat ggacgtgggt gggaaccaac aaatctctca ctgaagaagc 420 agagaactgg ggagatggtg agcccaacaa caagaagaac aaggaggact gcgtggagat 480 ctatatcaag agaaacaaag atgcaggcaa atggaacgat gacgcctgcc acaaactaaa 540 ggcagccctc tgttacacag cttcttgcca gccctggtca tgcagtggcc atggagaatg 600 tgtagaaatc atcaataatt acacctgcaa ctgtgatgtg gggtactatg ggccccagtg 660 tcagtttgtg attcagtgtg agcctttgga ggccccagag ctgggtacca tggactgtac 720 tcactctttg ggaaacttca gcttcagctc acagtgtgcc ttcagctgct ctgaaggaac 780 aaacttaact gggattgaag aaaccacctg tggaccattt ggaaactggt catctccaga 840 accaacctgt caagtgattc agtgtgagcc tctatcagca ccagatttgg ggatcatgaa 900 ctgtagccat cccctggcca gcttcagctt tacctctgca tgtaccttca tctgctcaga 960 aggaactgag ttaattggga agaagaaaac catttgtgaa tcatctggaa tctggtcaaa 1020 tcctagtcca atatgtcaaa aattggacaa aagtttctca atgattaagg agggtgatta 1080 taaccccctc ttcattccag tggcagtcat ggttactgca ttctctgggt tggcatttat 1140 catttggctg gcaaggagat taaaaaaagg caagaaatcc aagagaagta tgaatgaccc 1200 atattaaatc gcccttggtg aaagaaaatt cttggaatac taaaaatcat gagatccttt 1260 aaatccttcc atgaaacgtt ttgtgtggtg gcacctccta cgtcaaacat gaagtgtgtt 1320 tccttcagtg catctgggaa gatttctacc tgaccaacag ttccttcagc ttccatttcg 1380 cccctcattt atccctcaac ccccagccca caggtgttta tacagctcag ctttttgtct 1440 tttctgagga gaaacaaata agaccataaa gggaaaggat tcatgtggaa tataaagatg 1500 gctgactttg ctctttcttg actcttgttt tcagtttcaa ttcagtgctg tacttgatga 1560 cagacacttc taaatgaagt gcaaatttga tacatatgtg aatatggact cagttttctt 1620 gcagatcaaa tttcacgtcg tcttctgtat actgtggagg tacactctta tagaaagttc 1680 aaaaagtcta cgctctcctt tctttctaac tccagtgaag taatggggtc ctgctcaagt 1740 tgaaagagtc ctatttgcac tgtagcctcg ccgtctgtga attggaccat cctatttaac 1800 tggcttcagc ctccccacct tcttcagcca cctctctttt tcagttggct gacttccaca 1860 cctagcatct catgagtgcc aagcaaaagg agagaagaga gaaatagcct gcgctgtttt 1920 ttagtttggg ggttttgctg tttcctttta tgagacccat tcctatttct tatagtcaat 1980 gtttctttta tcacgatatt attagtaaga aaacatcact gaaatgctag ctgcaagtga 2040 catctctttg atgtcatatg gaagagttaa aacaggtgga gaaattcctt gattcacaat 2100 gaaatgctct cctttcccct gcccccagac cttttatccg acttacctag attctacata 2160 ttctttaaat ttcatctcag gcctccctca accccaccac ttcttttata actagtcctt 2220 tactaatcca acccatgatg agctcctctt cctggcttct tactgaaagg ttaccctgta 2280 acatgcaatt ttgcatttga ataaagcctg ctttttaagt gtt 2323 16 2615 DNA Homo sapiens 16 ccttttttgg cctcgacggc ggcaacccag cctccctcct aacgccctcc gcctttggga 60 ccaaccaggg gagctcaagt tagtagcagc caaggagagg cgctgccttg ccaagactaa 120 aaagggaggg gagaagagag gaaaaaagca agaatccccc acccctctcc cgggcggagg 180 gggcgggaag agcgcgtcct ggccaagccg agtagtgtct tccactcggt gcgtctctct 240 aggagccgcg cgggaaggat gctggtccgc aggggcgcgc gcgagggccc aggatgccgc 300 ggggctggac cgcgctttgc ttgctgagtt tgctgccttc tgggttcatg agtcttgaca 360 acaacggtac tgctacccca gagttaccta cccagggaac attttcaaat gtttctacaa 420 atgtatccta ccaagaaact acaacaccta gtacccttgg aagtaccagc ctgcaccctg 480 tgtctcaaca tggcaatgag gccacaacaa acatcacaga aacgacagtc aaattcacat 540 ctacctctgt gataacctca gtttatggaa acacaaactc ttctgtccag tcacagacct 600 ctgtaatcag cacagtgttc accaccccag ccaacgtttc aactccagag acaaccttga 660 agcctagcct gtcacctgga aatgtttcag acctttcaac cactagcact agccttgcaa 720 catctcccac taaaccctat acatcatctt ctcctatcct aagtgacatc aaggcagaaa 780 tcaaatgttc aggcatcaga gaagtgaaat tgactcaggg catctgcctg gagcaaaata 840 agacctccag ctgtgcggag tttaagaagg acaggggaga gggcctggcc cgagtgctgt 900 gtggggagga gcaggctgat gctgatgctg gggcccaggt atgctccctg ctccttgccc 960 agtctgaggt gaggcctcag tgtctactgc tggtcttggc caacagaaca gaaatttcca 1020 gcaaactcca acttatgaaa aagcaccaat ctgacctgaa aaagctgggg atcctagatt 1080 tcactgagca agatgttgca agccaccaga gctattccca aaagaccctg attgcactgg 1140 tcacctcggg agccctgctg gctgtcttgg gcatcactgg ctatttcctg atgaatcgcc 1200 gcagctggag ccccacagga gaaaggctgg gcgaagaccc ttattacacg gaaaacggtg 1260 gaggccaggg ctatagctca ggacctggga cctcccctga ggctcaggga aaggccagtg 1320 tgaaccgagg ggctcagaaa aacgggaccg gccaggccac ctccagaaac ggccattcag 1380 caagacaaca cgtggtggct gataccgaat tgtgactcgg ctaggtgggg caaggctggg 1440 cagtgtccga gagagcaccc ctctctgcat ctgaccacgt gctaccccca tgctggaggt 1500 gacatctctt acgcccaacc cttccccact gcacacacct cagaggctgt tcttggggcc 1560 ctacaccttg aggagggggc aggtaaactc ctgtccttta cacattcggc tccctggagc 1620 cagactctgg tcttctttgg gtaaacgtgt gacgggggaa agccaaggtc tggagaagct 1680 cccaggaaca atcgatggcc ttgcagcact cacacaggac ccccttcccc taccccctcc 1740 tctctgccgc aatacaggaa cccccagggg aaagatgagc ttttctaggc tacaattttc 1800 tcccaggaag ctttgatttt taccgtttct tccctgtatt ttctttctct actttgagga 1860 aaccaaagta accttttgca cctgctctct tgtaatgata tagccagaaa aacgtgttgc 1920 cttgaaccac ttccctcatc tctcctccaa gacactgtgg acttggtcac cagctcctcc 1980 cttgttctct aagttccact gagctccatg tgccccctct accatttgca gagtcctgca 2040 cagttttctg gctggagcct agaacaggcc tcccaagttt taggacaaac agctcagttc 2100 tagtctctct ggggccacac agaaactctt tttgggctcc tttttctccc tctggatcaa 2160 agtaggcagg accatgggac caggtcttgg agctgagcct ctcacctgta ctcttccgaa 2220 aaatcctctt cctctgaggc tggatcctag ccttatcctc tgatctccat ggcttcctcc 2280 tccctcctgc cgactcctgg gttgagctgt tgcctcagtc ccccaacaga tgcttttctg 2340 tctctgcctc cctcaccctg agccccttcc ttgctctgca cccccatatg gtcatagccc 2400 agatcagctc ctaaccctta tcaccagctg cctcttctgt gggtgaccca ggtccttgtt 2460 tgctgttgat ttctttccag aggggttgag cagggatcct ggtttcaatg acggttggaa 2520 atagaaattt ccagagaaga gagtattggg tagatatttt ttctgaatac aaagtgatgt 2580 gtttaaatac tgcaattaaa gtgatactga aacac 2615 17 5133 DNA Homo sapiens 17 cctctttcac cctgtctagg ttgccagcaa atcccacggg cctcctgacg ctgcccctgg 60 ggccacaggt ccctcgagtg ctggaaggat gaaggattcc tgcatcactg tgatggccat 120 ggcgctgctg tctgggttct ttttcttcgc gccggcctcg agctacaacc tggacgtgcg 180 gggcgcgcgg agcttctccc caccgcgcgc cgggaggcac tttggatacc gcgtcctgca 240 ggtcggaaac ggggtcatcg tgggagctcc aggggagggg aacagcacag gaagcctcta 300 tcagtgccag tcgggcacag gacactgcct gccagtcacc ctgagaggtt ccaactatac 360 ctccaagtac ttgggaatga ccttggcaac agaccccaca gatggaagca ttttggcctg 420 tgaccctggg ctgtctcgaa cgtgtgacca gaacacctat ctgagtggcc tgtgttacct 480 cttccgccag aatctgcagg gtcccatgct gcaggggcgc cctggttttc aggaatgtat 540 caagggcaac gtagacctgg tatttctgtt tgatggttcg atgagcttgc agccagatga 600 atttcagaaa attctggact tcatgaagga tgtgatgaag aaactcagca acacttcgta 660 ccagtttgct gctgttcagt tttccacaag ctacaaaaca gaatttgatt tctcagatta 720 tgttaaatgg aaggaccctg atgctctgct gaagcatgta aagcacatgt tgctgttgac 780 caataccttt ggtgccatca attatgtcgc gacagaggtg ttccgggagg agctgggggc 840 ccggccagat gccaccaaag tgcttatcat catcacggat ggggaggcca ctgacagtgg 900 caacatcgat gcggccaaag acatcatccg ctacatcatc gggattggaa agcattttca 960 gaccaaggag agtcaggaga ccctccacaa atttgcatca aaacccgcga gcgagtttgt 1020 gaaaattctg gacacatttg agaagctgaa agatctattc actgagctgc agaagaagat 1080 ctatgtcatt gagggcacaa gcaaacagga cctgacttcc ttcaacatgg agctgtcctc 1140 cagcggcatc agtgctgacc tcagcagggg ccatgcagtc gtgggggcag taggagccaa 1200 ggactgggct gggggctttc ttgacctgaa ggcagacctg caggatgaca catttattgg 1260 gaatgaacca ttgacaccag aagtgagagc aggctatttg ggttacaccg tgacctggct 1320 gccctcccgg caaaagactt cgttgctggc ctcgggagcc cctcgatacc agcacatggg 1380 ccgagtgctg ctgttccaag agccacaggg cggaggacac tggagccagg tccagacaat 1440 ccatgggacc cagattggct cttatttcgg tggggagctg tgtggcgtcg acgtggacca 1500 agatggggag acagagctgc tgctgattgg tgccccactg ttctatgggg agcagagagg 1560 aggccgggtg tttatctacc agagaagaca gttggggttt gaagaagtct cagagctgca 1620 gggggacccc ggctacccac tcgggcggtt tggagaagcc atcactgctc tgacagacat 1680 caacggcgat gggctggtag acgtggctgt gggggcccct ctggaggagc agggggctgt 1740 gtacatcttc aatgggaggc acggggggct tagtccccag ccaagtcagc ggatagaagg 1800 gacccaagtg ctctcaggaa ttcagtggtt tggacgctcc atccatgggg tgaaggacct 1860 tgaaggggat ggcttggcag atgtggctgt gggggctgag agccagatga tcgtgctgag 1920 ctcccggccc gtggtggata tggtcaccct gatgtccttc tctccagctg agatcccagt 1980 gcatgaagtg gagtgctcct attcaaccag taacaagatg aaagaaggag ttaatatcac 2040 aatctgtttc cagatcaagt ctctctaccc ccagttccaa ggccgcctgg ttgccaatct 2100 cacttacact ctgcagctgg atggccaccg gaccagaaga cgggggttgt tcccaggagg 2160 gagacatgaa ctcagaagga atatagctgt caccaccagc atgtcatgca ctgacttctc 2220 atttcatttc ccggtatgtg ttcaagacct catctccccc atcaatgttt ccctgaattt 2280 ctctctttgg gaggaggaag ggacaccgag ggaccaaagg gcgcagggca aggacatacc 2340 gcccatcctg agaccctccc tgcactcgga aacctgggag atcccttttg agaagaactg 2400 tggggaggac aagaagtgtg aggcaaactt gagagtgtcc ttctctcctg caagatccag 2460 agccctgcgt ctaactgctt ttgccagcct ctctgtggag ctgagcctga gtaacttgga 2520 agaagatgct tactgggtcc agctggacct gcacttcccc ccgggactct ccttccgcaa 2580 ggtggagatg ctgaagcccc atagccagat acctgtgagc tgcgaggagc ttcctgaaga 2640 gtccaggctt ctgtccaggg cattatcttg caatgtgagc tctcccatct tcaaagcagg 2700 ccactcggtt gctctgcaga tgatgtttaa tacactggta aacagctcct ggggggactc 2760 ggttgaattg cacgccaatg tgacctgtaa caatgaggac tcagacctcc tggaggacaa 2820 ctcagccact accatcatcc ccatcctgta ccccatcaac atcctcatcc aggaccaaga 2880 agactccaca ctctatgtca gtttcacccc caaaggcccc aagatccacc aagtcaagca 2940 catgtaccag gtgaggatcc agccttccat ccacgaccac aacataccca ccctggaggc 3000 tgtggttggg gtgccacagc ctcccagcga ggggcccatc acacaccagt ggagcgtgca 3060 gatggagcct cccgtgccct gccactatga ggatctggag aggctcccgg atgcagctga 3120 gccttgtctc cccggagccc tgttccgctg ccctgttgtc ttcaggcagg agatcctcgt 3180 ccaagtgatc gggactctgg agctggtggg agagatcgag gcctcttcca tgttcagcct 3240 ctgcagctcc ctctccatct ccttcaacag cagcaagcat ttccacctct atggcagcaa 3300 cgcctccctg gcccaggttg tcatgaaggt tgacgtggtg tatgagaagc agatgctcta 3360 cctctacgtg ctgagcggca tcggggggct gctgctgctg ctgctcattt tcatagtgct 3420 gtacaaggtt ggtttcttca aacggaacct gaaggagaag atggaggctg gcagaggtgt 3480 cccgaatgga atccctgcag aagactctga gcagctggca tctgggcaag aggctgggga 3540 tcccggctgc ctgaagcccc tccatgagaa ggactctgag agtggtggtg gcaaggactg 3600 agtccaggcc tgtgaggtgc agagtgccca gaactggact caggatgccc agggccactc 3660 tgcctctgcc tgcattctgc cgtgtgccct cgggcgagtc actgcctctc cctggccctc 3720 agtttcccta tctcgaacat ggaactcatt cctgaatgtc tcctttgcag gctcataggg 3780 aagacctgct gagggaccag ccaagagggc tgcaaaagtg agggcttgtc attaccagac 3840 ggttcaccag cctctcttgg ttccttcctt ggaagagaat gtctgatcta aatgtggaga 3900 aactgtagtc tcaggaccta gggatgttct ggccctcacc cctgccctgg gatgtccaca 3960 gatgcctcca ccccccagaa cctgtccttg cacactcccc tgcactggag tccagtctct 4020 tctgctggca gaaagcaaat gtgacctgtg tcactacgtg actgtggcac acgccttgtt 4080 cttggccaaa gaccaaattc cttggcatgc cttccagcac cctgcaaaat gagaccctcg 4140 tggccttccc cagcctcttc tagagccgtg atgcctccct gttgaagctc tggtgacacc 4200 agcctttctc ccaggccagg ctccttcctg tcttcctgca ttcacccaga cagctccctc 4260 tgcctgaacc ttccatctcg cccacccctc cttccttgac cagcagatcc cagctcacgt 4320 cacacacttg gttgggtcct cacatctttc acacttccac caccctgcac tactccctca 4380 aagcacacgt catgtttctt catccggcag cctggatgtt ttttccctgt ttaatgattg 4440 acgtacttag cagctatctc tcagtgaact gtgagggtaa aggctatact tgtcttgttc 4500 accttgggat gacgccgcat gatatgtcag ggcgtgggac atctagtagg tgcttgacat 4560 aatttcactg aattaatgac agagccagtg ggaagataca gaaaaagagg gccggggctg 4620 ggcgcggtgg ttcacgcctg taatcccagc actttgggag gccaaggagg gtggatcacc 4680 tgaggtcagg agttagaggc cagcctggcg aaaccccatc tctactaaaa atacaaaatc 4740 caggcgtggt ggcacacacc tgtagtccca gctactcagg aggttgaggt aggagaattg 4800 cttgaacctg ggaggtggag gttgcagtga gccaagattg cgccattgca ctccagcctg 4860 ggcaacacag cgagactccg tctcaaggaa aaaataaaaa taaaaagcgg gcacgggccc 4920 ggacatcccc acccttggag gctgtcttct caggctctgc cctgccctag ctccacaccc 4980 tctcccagga cccatcacgc ctgtgcagtg gcccccacag aaagactgag ctcaaggtgg 5040 gaaccacgtc tgctaacttg gagccccagt gccaagcaca gtgcctgcat gtatttatcc 5100 aataaatgtg aaattctgtc caaaaaaaaa aaa 5133 18 4740 DNA Homo sapiens 18 gaattccgtg gttcctcagt ggtgcctgca acccctggtt cacctccttc caggttctgg 60 ctccttccag ccatggctct cagagtcctt ctgttaacag ccttgacctt atgtcatggg 120 ttcaacttgg acactgaaaa cgcaatgacc ttccaagaga acgcaagggg cttcgggcag 180 agcgtggtcc agcttcaggg atccagggtg gtggttggag ccccccagga gatagtggct 240 gccaaccaaa ggggcagcct ctaccagtgc gactacagca caggctcatg cgagcccatc 300 cgcctgcagg tccccgtgga ggccgtgaac atgtccctgg gcctgtccct ggcagccacc 360 accagccccc ctcagctgct ggcctgtggt cccaccgtgc accagacttg cagtgagaac 420 acgtatgtga aagggctctg cttcctgttt ggatccaacc tacggcagca gccccagaag 480 ttcccagagg ccctccgagg gtgtcctcaa gaggatagtg acattgcctt cttgattgat 540 ggctctggta gcatcatccc acatgacttt cggcggatga aggagtttgt ctcaactgtg 600 atggagcaat taaaaaagtc caaaaccttg ttctctttga tgcagtactc tgaagaattc 660 cggattcact ttaccttcaa agagttccag aacaacccta acccaagatc actggtgaag 720 ccaataacgc agctgcttgg gcggacacac acggccacgg gcatccgcaa agtggtacga 780 gagctgttta acatcaccaa cggagcccga aagaatgcct ttaagatcct agttgtcatc 840 acggatggag aaaagtttgg cgatcccttg ggatatgagg atgtcatccc tgaggcagac 900 agagagggag tcattcgcta cgtcattggg gtgggagatg ccttccgcag tgagaaatcc 960 cgccaagagc ttaataccat cgcatccaag ccgcctcgtg atcacgtgtt ccaggtgaat 1020 aactttgagg ctctgaagac cattcagaac cagcttcggg agaagatctt tgcgatcgag 1080 ggtactcaga caggaagtag cagctccttt gagcatgaga tgtctcagga aggcttcagc 1140 gctgccatca cctctaatgg ccccttgctg agcactgtgg ggagctatga ctgggctggt 1200 ggagtctttc tatatacatc aaaggagaaa agcaccttca tcaacatgac cagagtggat 1260 tcagacatga atgatgctta cttgggttat gctgccgcca tcatcttacg gaaccgggtg 1320 caaagcctgg ttctgggggc acctcgatat cagcacatcg gcctggtagc gatgttcagg 1380 cagaacactg gcatgtggga gtccaacgct aatgtcaagg gcacccagat cggcgcctac 1440 ttcggggcct ccctctgctc cgtggacgtg gacagcaacg gcagcaccga cctggtcctc 1500 atcggggccc cccattacta cgagcagacc cgagggggcc aggtgtccgt gtgccccttg 1560 cccagggggc agagggctcg gtggcagtgt gatgctgttc tctacgggga gcagggccaa 1620 ccctggggcc gctttggggc agccctaaca gtgctggggg acgtaaatgg ggacaagctg 1680 acggacgtgg ccattggggc cccaggagag gaggacaacc ggggtgctgt ttacctgttt 1740 cacggaacct caggatctgg catcagcccc tcccatagcc agcggatagc aggctccaag 1800 ctctctccca ggctccagta ttttggtcag tcactgagtg ggggccagga cctcacaatg 1860 gatggactgg tagacctgac tgtaggagcc caggggcacg tgctgctgct caggtcccag 1920 ccagtactga gagtcaaggc aatcatggag ttcaatccca gggaagtggc aaggaatgta 1980 tttgagtgta atgatcaggt ggtgaaaggc aaggaagccg gagaggtcag agtctgcctc 2040 catgtccaga agagcacacg ggatcggcta agagaaggac agatccagag tgttgtgact 2100 tatgacctgg ctctggactc cggccgccca cattcccgcg ccgtcttcaa tgagacaaag 2160 aacagcacac gcagacagac acaggtcttg gggctgaccc agacttgtga gaccctgaaa 2220 ctacagttgc cgaattgcat cgaggaccca gtgagcccca ttgtgctgcg cctgaacttc 2280 tctctggtgg gaacgccatt gtctgctttc gggaacctcc ggccagtgct ggcggaggat 2340 gctcagagac tcttcacagc cttgtttccc tttgagaaga attgtggcaa tgacaacatc 2400 tgccaggatg acctcagcat caccttcagt ttcatgagcc tggactgcct cgtggtgggt 2460 gggccccggg agttcaacgt gacagtgact gtgagaaatg atggtgagga ctcctacagg 2520 acacaggtca ccttcttctt cccgcttgac ctgtcctacc ggaaggtgtc cacactccag 2580 aaccagcgct cacagcgatc ctggcgcctg gcctgtgagt ctgcctcctc caccgaagtg 2640 tctggggcct tgaagagcac cagctgcagc ataaaccacc ccatcttccc ggaaaactca 2700 gaggtcacct ttaatatcac gtttgatgta gactctaagg cttcccttgg aaacaaactg 2760 ctcctcaagg ccaatgtgac cagtgagaac aacatgccca gaaccaacaa aaccgaattc 2820 caactggagc tgccggtgaa atatgctgtc tacatggtgg tcaccagcca tggggtctcc 2880 actaaatatc tcaacttcac ggcctcagag aataccagtc gggtcatgca gcatcaatat 2940 caggtcagca acctggggca gaggagcctc cccatcagcc tggtgttctt ggtgcccgtc 3000 cggctgaacc agactgtcat atgggaccgc ccccaggtca ccttctccga gaacctctcg 3060 agtacgtgcc acaccaagga gcgcttgccc tctcactccg actttctggc tgagcttcgg 3120 aaggcccccg tggtgaactg ctccatcgct gtctgccaga gaatccagtg tgacatcccg 3180 ttctttggca tccaggaaga attcaatgct accctcaaag gcaacctctc gtttgactgg 3240 tacatcaaga cctcgcataa ccacctcctg atcgtgagca cagctgagat cttgtttaac 3300 gattccgtgt tcaccctgct gccgggacag ggggcgtttg tgaggtccca gacggagacc 3360 aaagtggagc cgttcgaggt ccccaacccc ctgccgctca tcgtgggcag ctctgtcggg 3420 ggactgctgc tcctggccct catcaccgcc gcgctgtaca agctcggctt cttcaagcgg 3480 caatacaagg acatgatgag tgaagggggt cccccggggg ccgaacccca gtagcggctc 3540 cttcccgaca gagctgcctc tcggtggcca gcaggactct gcccagacca cacgtagccc 3600 ccaggctgct ggacacgtcg gacagcgaag tatccccgac aggacgggct tgggcttcca 3660 tttgtgtgtg tgcaagtgtg tatgtgcgtg tgtgcgagtg tgtgcaagtg tctgtgtgca 3720 agtgtgtgca cgtgtgcgtg tgcgtgcatg tgcactcgca cgcccatgtg tgagtgtgtg 3780 caagtatgtg agtgtgtcca gtgtgtgtgc gtgtgtccat gtgtgtgcag tgtgtgcatg 3840 tgtgcgagtg tgtgcatgtg tgtgctcagg ggctgtggct cacgtgtgtg actcagagtg 3900 tctctggcgt gtgggtaggt gacggcagcg tagcctctcc ggcagaaggg aactgcctgg 3960 gctcccttgt gcgtgggtaa gccgctgctg ggttttcctc cgggagaggg gacggtcaat 4020 cctgtgggtg aagagagagg gaaacacagc agcatctctc cactgaaaga agtgggactt 4080 cccgtcgcct gcgagcctgc ggcctgctgg agcctgcgca gcttggatgg atactccatg 4140 agaaaagccg tgggtggaac caggagcctc ctccacacca gcgctgatgc ccaataaaga 4200 tgcccactga ggaatcatga agcttccttt ctggattcat ttattatttc aatgtgactt 4260 taattttttg gatggataag cctgtctatg gtacaaaaat cacaaggcat tcaagtgtac 4320 agtgaaaagt ctccctttcc agatattcaa gtcacctcct taaaggtagt caagattgtg 4380 ttttgaggtt tccttcagac agattccagg cgatgtgcaa gtgtatgcac gtgtgcacac 4440 accacacaca tacacacaca caagcttttt tacacaaatg gtagcatact ttatattggt 4500 ctgtatcttg ctttttttca ccaatatttc tcagacatcg gttcatatta agacataaat 4560 tactttttca ttcttttata ccgctgcata gtattccatt gtgtgagtgt accataatgt 4620 atttaaccag tcttcttttg atatactatt ttcatctctt gttattgcat ctgctgagtt 4680 aataaatcaa atatatgtca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4740 

We claim:
 1. A non-viral vector comprising: a vesicular membrane with hepatitis B envelope (env) protein containing a cardiac targeting sequence exposed on the surface of the vesicle and a nucleic acid construct comprising a nucleotide sequence for cardiovascular gene therapy and a promoter sequence functional in cardiovascular cells.
 2. The vesicle vector of claim 1, wherein the env protein contains mutations to reduce antigenicity.
 3. The vesicle vector of claim 1, wherein the expression construct is DNA.
 4. The vesicle vector of claim 1, wherein the expression construct is double stranded plasmid DNA.
 5. The vesicle vector of claim 1, wherein the expression construct is RNA.
 6. The vesicle vector of claim 1, wherein the promoter is a non-tissue specific promoter.
 7. The vesicle vector of claim 6, wherein the viral promoter is selected from the group consisting of cytomegalovirus promoter, Rous sarcoma virus promoter ubiquitin promoter, chicken β-actin promoter and elongation factor 1α promoter.
 8. The vesicle vector of claim 1, wherein the promoter is a cardiomyocyte specific promoter.
 9. The vesicle vector of claim 8, wherein the cardiomyocyte specific promoter is selected from the group consisting of myosin light chain 2v promoter, cardiac ankyrin repeat protein (CARP) promoter, ANF promoter and BNP promoter.
 10. The vesicle vector of claim 1, wherein the promoter is a smooth muscle cell specific promoter.
 11. The vesicle vector of claim 10, where the smooth muscle cell specific promoter is SM22 promoter.
 12. The vesicle vector of claim 11, wherein the promoter is an endothelial cell specific promoter.
 13. The vesicle vector of claim 12, wherein the endothelial cell specific promoter is selected from the group consisting of Flt-1 promoter, Flk-1 promoter, endothelial type nitric oxide synthase promoter and endothelin promoter.
 14. The vesicle vector of claim 1, wherein the expression construct comprises inverted terminal repeat sequences from adeno-associated virus (AAV-ITR).
 15. The vesicle vector of claim 1, wherein the expression construct comprises eukaryotic transposon and transposase elements.
 16. The vesicle vector of claim 1, wherein the cardiovascular targeting sequence comprises a viral protein sequence.
 17. The vesicle vector of claim 1, wherein the cardiovascular targeting sequence comprises a natural ligand for a receptor on cardiovascular cells.
 18. A non-viral vesicle vector comprising: a vesicular membrane with hepatitis B env protein exposed on the vesicle surface and a protein for treatment of cardiovascular disease.
 19. The vesicle vector of claim 18, wherein the env protein contains mutations to reduce antigenicity.
 20. A method for treatment of cardiac disease comprising: intravenous administration to an individual with cardiac disease a non-viral vesicle vector comprising a vesicular membrane with hepatitis B env protein with a cardiac targeting sequence exposed on the vesicle surface and a nucleic acid construct comprising a nucleotide sequence for cardiac gene therapy and a promoter sequence functional in cardiac cells monitoring the individual for amelioration of disease. 